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00 (54) Title: GENES ESSENTIAL FOR MICROBIAL PROLIFERATION AND ANTISENSE THERETO 
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(57) Abstract: The sequences of nucleic acids encoding proteins required forE. coli proliferation are disclosed. The nucleic acids 
^ can be used to express proteins or portions thereof, to obtain antibodies capable of specifically binding to the expressed proteins, and 

to use those expressed proteins as a screen to isolate candidate molecules for rational drug discovery programs. The nucleic acids 
Q can also be used to screen for homologous genes that are required for proliferation in microorganisms other than E. coli. The nucleic 

acids can also be used to design expression vectors and secretion vectors. The nucleic acids of the present invention can also be used 
^ in various assay systems to screen for proliferation required genes in other organisms as well as to screen for antimicrobial agents. 
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GENES ESSENTIAL FOR MICROBIAL PROLIFERATION AND ANTKENSE THERETO 

BACKGROUND OF THE INVENTION 
Since the discovery of penicillin, the use of antibiotics to treat the ravages of bacterial 
5 infections has saved millions of lives. With the advent of these "miracle drugs/' for a time it was 
popularly believed that humanity might, once and for all, be saved from the scourge of bacterial 
infections. In fact, during the 1980s and early 1990s, many large pharmaceutical companies cut 
back or eliminated antibiotics research and development. They believed that infectious disease 
caused by bacteria finally had been conquered and that markets for new drugs were limited. 

1 0 Unfortunately, this belief was overly optimistic. 

The tide is beginning to turn in favor of the bacteria as reports of drug resistant bacteria 
become more frequent. The United States Centers for Disease Control announced that one of the most 
powerful known antibiotics, vancomycin, was unable to treat an infection of the common 
Staphylococcus aureus (staph). This organism is commonly found in our environment and is 

15 responsible for many nosocomial infections. The import of this announcement becomes clear when 
one considers that vancomycin was used for years to treat infections caused by stubborn strains of 
bacteria, like staph. In short, the bacteria are becoming resistant to our most powerful antibiotics. 
If this trend continues, it is conceivable that we will return to a time when what are presently 
considered minor bacterial infections are fatal diseases. 

20 There are a number of causes for the predicament in which practitioners of medical arts find 

themselves. Over-prescription and improper prescription habits by some physicians have caused an 
indiscriminate increase in the availability of antibiotics to the public. The patient is also partly 
responsible, for even in instances where an antibiotic is the appropriate treatment, patients will often 
improperly use the drug, the result being yet another population of bacteria that is resistant, in whole or 

25 in part, to traditional antibiotics. 

The bacterial scourges that have haunted humanity remain, in spite of the development of 
modern scientific practices to deal with the diseases that they cause. Drug resistant bacteria are now 
advancing on the health of humanity. A new generation of antibiotics to once again deal with the 
pending health threat that bacteria present is required. 

30 Discovery of New Antibiotics 

As more and more bacterial strains become resistant to the panel of available antibiotics, 
new compounds are required. In the past, practitioners of pharmacology would have to rely upon 
traditional methods of drug discovery to generate novel, safe and efficacious compounds for the 
treatment of disease. Traditional drug discovery methods involve blindly testing potential drug 

35 candidate-molecules, often selected at random, in the hope that one might prove to be an effective 
treatment for some disease. The process is painstaking and laborious, with no guarantee, of success. 
Today, the average cost to discover and develop a new drug is nearly US $500 million, and the 
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average time is 15 years from laboratory to patient Improving this process, even incrementally, 
would represent a huge advance in the generation of novel antimicrobial agents. 

Newly emerging practices in drug discovery utilize a number of biochemical techniques to 
provide for directed approaches to creating new drugs, rather than discovering them at random. For 
5 example, gene sequences and proteins encoded thereby that are required for the proliferation of an 
organism make for excellent targets since exposure of bacteria to compounds active against these 
targets would result in the inactivation of the organism. Once a target is identified, biochemical 
analysis of that target can be used to discover or to design molecules that interact with and alter the 
functions of the target Using physical and computational techniques, to analyze structural and 

10 biochemical targets in order to derive compounds that interact with a target is called rational drug 
design and offers great future potential. Thus, emerging drug discovery practices use molecular 
modeling techniques, combinatorial chemistry approaches, and other means to produce and screen 
and/or design large numbers of candidate compounds. 

Nevertheless, while this approach to drug discovery is clearly the way of the future, 

1 5 problems remain. For example, the initial step of identifying molecular targets for investigation can 
be an extremely time consuming task. It may also be difficult to design molecules that interact with 
the target by using computer modeling techniques. Furthermore, in cases where the function of the 
target is not known or is poorly understood, it may be difficult to design assays to detect molecules 
that interact with and alter the functions of the target. To improve the rate of novel drug discovery 

20 and development, methods of identifying important molecular targets in pathogenic 
microorganisms and methods for identifying molecules that interact with and alter the functions of 
such molecular targets are urgently required. 

Escherichia coli represents an excellent model system to understand bacterial biochemistry 
and physiology. The estimated 4288 genes scattered along the 4.6 x 10 6 base pairs of the 

25 Escherichia coli (£. coli) chromosome offer tremendous promise for the understanding of bacterial 
biochemical processes. In turn, this knowledge will assist in the development of new tools for the 
diagnosis and treatment of bacteria-caused human disease. The entire E. coli genome has been 
sequenced, and this body of information holds a tremendous potential for application to the 
discovery and development of new antibiotic compounds. Yet, in spite of this accomplishment, the 

30 general functions or roles of many of these genes are still unknown. For example, the total number 
of proliferation-required genes contained within the E. coli genome is unknown, but has been 
variously estimated at around 200 to 700 (Armstrong, K.A. and Fan, D.P. Essential Genes in the 
metB-malB Region of Escherichia coli K12, 1975, J. Bacteriol. 126: 48-55). 

Novel, safe and effective antimicrobial compounds are needed in view of the rapid rise of 

35 antibiotic resistant microorganisms. However, prior to this invention, the characterization of even a 
single bacterial gene was a painstaking process, requiring years of effort. Accordingly, there is an 
urgent need for more novel methods to identify and characterize bacterial genomic sequences that 
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encode gene products required for proliferation and for methods to identify molecules that interact 
with and alter the functions of such genes and gene products. 

SUMMARY OF THE INVENTION 
One embodiment of the present invention is a purified or isolated nucleic acid sequence 
5 consisting essentially of one of SEQ ID NOs: 1-127, wherein expression of said nucleic acid 
inhibits proliferation of a microorganism. The nucleic acid sequence may be complementary to at 
least a portion of a coding sequence of a gene whose expression is required for proliferation of a 
microorganism. The nucleic acid sequence may be complementary to at least a portion of an RNA 
required for proliferation of a microorganism. The RNA may be an RNA encoding more than one 
10 gene product. 

Another embodiment of the present invention is a nucleic acid comprising a fragment of 
one of SEQ ID NOs.: 1-127, said fragment selected from the group consisting of fragments 
comprising at least 10, at least 20, at least 25, at least 30, at least 50 and more than 50 consecutive 
bases of one of SEQ ID NOs: 1-127. 

15 Another embodiment of the present invention is a vector comprising a promoter operably 

linked to the nucleic acid sequences of each of the preceding paragraphs. The promoter may be 
active in a microorganism selected from the group consisting of Aspergillus fumigatus, Bacillus 
anthracis, Campylobacter jejuni, Candida albicans, Chlamydia pneumoniae, Chlamydia 
trachomatus, Clostridium botulinum, Cryptococcus neoformans, Enterobacter cloacae, 

20 Enterococcus faecalis, Escherichia coli, Haemophilus influenzae, Helicobacter pylori, Klebsiella 
pneumoniae, Mycobacterium leprae, Mycobacterium tuberculosis, Neisseria gonorrhoeae, 
Pseudomonas aeruginosa, Saccharomyces cerevisae, Salmonella cholerasuis, Salmonella 
paratyphi, Salmonella typhi, Salmonella typhimurium, Staphylococcus aureus, Staphylococcus 
epidermidis, Streptococcus pneumoniae, Treponema pallidum, and Yersinia pestis or any species 

25 falling within the genera of any of the above species. 

Another embodiment of the present invention is a host cell containing the vectors of the 
preceding paragraph. 

Another embodiment of the present invention is a purified or isolated nucleic acid 
consisting essentially of the coding sequence of one of SEQ ID NOs: 1 28-298. 

30 Another embodiment of the present invention is a fragment of the nucleic acid of the 

preceding paragraph, said fragment comprising at least 10, at least 20, at least 25, at least 30, at 
least 50 or more than 50 consecutive bases of one of SEQ ID NOs: 128-298. 

Another embodiment of the present invention is a vector comprising a promoter operably 
linked to the nucleic acid of the preceding two paragraphs. 

35 Another embodiment of the present invention is a purified or isolated antisense nucleic acid 

comprising a nucleic acid sequence complementary to at least a portion of an intragenic sequence, 
intergenic sequence, sequences spanning at least a portion of two or more genes, 5' noncoding 
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region, or 3* noncoding region within an operon comprising a proliferation-required gene whose 
activity or expression is inhibited by an antisense nucleic acid comprising one of SEQ ID NOs.: 1- 
127. 

Another embodiment of the present invention is a purified or isolated nucleic acid 
5 comprising a nucleic acid having at least 70% identity to a sequence selected from the group 
consisting of SEQ ID NOs.: 1-127, fragments comprising at least 25 consecutive nucleotides of 
SEQ ID NOs.: 1-127, the sequences complementary to SEQ ID NOs.: 1-127 and die sequences 
complementary to fragments comprising at least 25 consecutive nucleotides of SEQ ID NOs.: 1-127 
as determined using BLASTN version 2.0 with the default parameters. The nucleic acid may be 
10 from an organism selected from the group consisting of Aspergillus Jumigatus, Bacillus anthracis, 
Campylobacter jejuni, Candida albicans, Chlamydia pneumoniae, Chlamydia trachomatus, 
Clostridium botulinum, Cryptococcus neoformans, Enterobacter cloacae, Enterococcus faecalis, 
Staphylococcus aureus, Haemophilus influenzae, Helicobacter pylori, Klebsiella pneumoniae, 
Mycobacterium leprae, Mycobacterium tuberculosis, Neisseria gonorrhoeae, Pseudomonas 
15 aeruginosa, Saccharomyces cerevisae, Salmonella cholerasuis, Salmonella paratyphi, Salmonella 
typhi, Salmonella typhimurium, Staphylococcus aureus, Staphylococcus epidermidis. Streptococcus 
pneumoniae, Treponema pallidum, and Yersinia pestis or any species failing within the genera of 
any of the above species. 

Another embodiment of the present invention is a vector comprising a promoter operably 
20 linked to a nucleic acid encoding a polypeptide whose expression is inhibited by an antisense 
nucleic acid comprising one of SEQ ID NOs.: 1-127. The polypeptide may comprise a polypeptide 
comprising a sequence selected from the group consisting of SEQ ID NOs: 299-469. 

Another embodiment of the present invention is a host cell containing the vector of the 
preceding paragraph. 

25 Another embodiment of the present invention is a purified or isolated polypeptide 

comprising a polypeptide whose expression is inhibited by an antisense nucleic acid comprising one 
of SEQ ID NOs.: 1-127, or a fragment selected from the group consisting of fragments comprising 
at (east 5, at least 10, at least 20, at least 30, at least 40, at least 50, at least 60 or more than 60 
consecutive amino acids of one of the said polypeptides. The polypeptide may comprise a 

30 polypeptide comprising one of SEQ ID NOs.: 299-469 or a fragment comprising at least 5, at least 
10, at least 20, at least 30, at least 40, at least 50, at least 60 or more than 60 consecutive amino 
acids of a polypeptide comprising a sequence selected from the group consisting of SEQ ID NOs.: 
299-469. 

Another embodiment of the present invention is a purified or isolated polypeptide 
35 comprising a polypeptide having at least 25% identity to a polypeptide whose expression is 
inhibited by a sequence selected from the group consisting of SEQ ID NOs.: 1-127, or at least 25% 
identity to a fragment comprising at least 5, at least 10, at least 20, at least 30, at least 40, at least 
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50, at least 60 or more than 60 consecutive amino acids of a polypeptide whose expression is 
inhibited by a nucleic acid selected from the group consisting of SEQ ID NOs.: 1-127 as 
determined using FASTA version 3.0t78 with the default parameters. The polypeptide may have at 
least 25% identity to a polypeptide comprising one of SEQ ID NOs: 299-469 or at least 25% 
5 identity to a fragment comprising at least 5, at least 10, at least 20, at least 30, at least 40, at least 
50, at least 60 or more than 60 consecutive amino acids of a polypeptide comprising one of SEQ ID 
NOs.: 299-469 as determined using FASTA version 3.0t78 with the default parameters. 

Another embodiment of the present invention is an antibody capable of specifically binding 
one of the polypeptides of the preceding paragraph. 

10 Another embodiment of the present invention is a method of producing a polypeptide, 

comprising introducing a vector comprising a promoter operably linked to a nucleic acid encoding a 
polypeptide whose expression is inhibited by an antisense nucleic acid comprising one of SEQ ID 
NOs.: 1-127 into a cell. The method may further comprise the step of isolating said polypeptide. 
The polypeptide may comprise a sequence selected from the group consisting of SEQ ID NOs.: 

15 299-469. 

Another embodiment of the present invention is a method of inhibiting proliferation of a 
microorganism comprising inhibiting the activity or reducing the amount of a gene product whose 
expression is inhibited by an antisense nucleic acid comprising a sequence selected from the group 
consisting of SEQ ID NOs.: 1-127 or inhibiting the activity or reducing the amount of a nucleic acid 

20 encoding said gene product. The gene product may comprise a polypeptide comprising a sequence 
selected from the group consisting of SEQ ID NOs.: 299-469. 

Another embodiment of the present invention is a method for identifying a compound 
which influences the activity of a gene product required for proliferation, said gene product 
comprising a gene product whose expression is inhibited by an antisense nucleic acid comprising a 

25 sequence selected from the group consisting of SEQ ID NOs.: 1-127, said method comprising 
contacting said gene product with a candidate compound and determining whether said compound 
influences the activity of said gene product. The gene product may be a polypeptide and said 
activity may be an enzymatic activity. The gene product may be a polypeptide and said activity 
may be a carbon compound catabolism activity. The gene product may be a polypeptide and said 

30 activity may be a biosynthetic activity. The gene product may be a polypeptide and said activity 
may be a transporter activity. The gene product may be a polypeptide and said activity may be a 
transcriptional activity. The gene product may be a polypeptide and said activity may be a DNA 
replication activity. The gene product may be a polypeptide and said activity my be a cell division 
activity. The gene product may be a polypeptide comprising a sequence selected from the group 

35 consisting of SEQ ID NOs.: 299-469. 

Another embodiment of the present invention is a compound identified using the methods 
of the preceding paragraph. 
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A The method of Claim 28, wherein said gene product is a polypeptide comprising a 
sequence selected from the group consisting of SEQ ID NOs.: 299-469. 

method for identifying a compound or nucleic acid having the ability to reduce the activity or level 
of a gene product required for proliferation, said gene product comprising a gene product whose 
5 activity or expression is inhibited by an antisense nucleic acid comprising a sequence selected from 
the group consisting of SEQ ID NOs.: 1-127, said method comprising: 

(a) providing a target that is a gene or RNA, wherein said target comprises a nucleic 
acid encoding said gene product; 

(b) contacting said target with a candidate compound or nucleic acid; and 
10 (c) measuring an activity of said target. 

The target may be a messenger RNA molecule and said activity may be translation of said 
messenger RNA. The target may be a messenger RNA molecule and said activity may be 
transcription of a gene encoding said messenger RNA. The target may be a gene and said activity 
may be transcription of said gene. The target may be a nontranslated RNA and said activity may be 

15 processing or folding of said nontranslated RNA or assembly of said nontranslated RNA into a 
protein/RNA complex. The target gene or RNA may encode a polypeptide comprising a sequence 
selected from the group consisting of SEQ ID NOs.: 299-469. 

Another embodiment of the present invention is a compound or nucleic acid identified 
using the methods of the preceding paragraph. 

20 Another embodiment of the present invention is a method for identifying a compound 

which reduces the activity or level of a gene product required for proliferation of a microorganism, 
wherein the activity or expression of said gene product is inhibited by an antisense nucleic acid 
comprising a sequence selected from the group consisting of SEQ ID NOs.: 1-127, said method 
comprising the steps of: 

25 (a) expressing a sublethal level of an antisense nucleic acid complementary to a 

nucleic acid encoding said gene product in a cell to reduce the activity or amount of said gene 
product in said cell, thereby producing a sensitized cell; 

(b) contacting said sensitized cell with a compound; and 

(c) determining whether said compound inhibits the growth of said sensitized cell. 

30 The determining step may comprise determining whether said compound inhibits the growth of said 
sensitized cell to a greater extent than said compound inhibits the growth of a nonsensitized cell. 
The cell may be selected from the group consisting of bacterial cells, fungal cells, plant cells, and 
animal cells. The cell may be a Gram negative bacterium. The cell may be an E. coli cell. The cell 
may be from an organism selected from the group consisting of Aspergillus fumigatus, Bacillus 

35 anthracis, Campylobacter jejuni, Candida albicans, Chlamydia pneumoniae, Chlamydia 
trachomatus, Clostridium botulinum, Cryptococcus neoformans, Enterohacter cloacae, 
Enterococcus faecalis, Haemophilus influenzae, Helicobacter pylori, Klebsiella pneumoniae, 
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Mycobacterium leprae, Mycobacterium tuberculosis, Neisseria gonorrhoeae, Pseudomonas 
aeruginosa, Saccharomyces cerevisae, Salmonella cholerasuis, Salmonella paratyphi, Salmonella 
typhi, Salmonella typhimurium, Staphylococcus aureus, Staphylococcus epidermidis, Streptococcus 
pneumoniae, Treponema pallidum, and Yersinia pestis or any species falling within the genera of 
5 any of the above species. The antisense nucleic acid may be transcribed from an inducible 
promoter. The method may further comprise the step of contacting said cell with a concentration of 
inducer which induces said antisense nucleic acid to a sublethal level. Growth inhibition may be 
measured by monitoring optical density of a culture growth solution. The gene product may be a 
polypeptide. The polypeptide may comprise a sequence selected from the group consisting of SEQ 

10 ID NOs.: 299-469. The gene product may be an RNA. 

Another embodiment of the present invention is a compound identified using the methods 
of the preceding paragraph. 

Another embodiment of the present invention is a method for inhibiting cellular 
proliferation comprising introducing a compound with activity against a gene whose activity or 

15 expression is inhibited by an antisense nucleic acid comprising a sequence selected from the group 
consisting of SEQ ID NOs.: 1-127 or a compound with activity against the product of said gene into 
a population of cells expressing said gene. The compound may be an antisense nucleic acid 
comprising a sequence selected from the group consisting of SEQ ID NOs.: 1-127, or a 
proliferation-inhibiting portion thereof. The proliferation inhibiting portion of one of SEQ ID 

20 NOs.: 1-127 may be a fragment comprising at least 10, at least 20, at least 25, at least 30, at least 50 
or more than 51 consecutive bases of one of SEQ ID NOs.: 1-127. The population may be a 
population selected from the group consisting of bacterial cells, fungal cells, plant cells, and animal 
cells. The population may be a population of Gram negative bacteria. The population may be a 
population of K coli cells. The population may be a population selected from the group consisting 

25 of Aspergillus fumigatus, Bacillus anthracis, Campylobacter jejuni, Candida albicans, Chlamydia 
pneumoniae, Chlamydia trachomatus, Clostridium botulinum, Cryptococcus neoformans, 
Enterobacter cloacae, Enterococcus faecalis, Haemophilus influenzae, Helicobacter pylori, 
Klebsiella pneumoniae, Mycobacterium leprae, Mycobacterium tuberculosis, Neisseria 
gonorrhoeae, Pseudomonas aeruginosa,Saccharomyces cerevisae, Salmonella cholerasuis, 

30 Salmonella paratyphi, Salmonella typhi. Salmonella typhimurium, Staphylococcus aureus, 
Staphylococcus epidermidis, Streptococcus pneumoniae, Treponema pallidum, and Yersinia pestis 
cells or cells from any species falling within the genera of any of the above species. The gene may 
encode a polypeptide comprising a sequence selected from the group consisting of SEQ ID NOs.: 
299-469. 

35 Another embodiment of the present invention is a preparation comprising an effective 

concentration of an antisense nucleic acid comprising a sequence selected from the group consisting 
of SEQ ID NOs.: 1-127, or a proliferation-inhibiting portion thereof in a pharmaceutically 

-7- 
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acceptable carrier. The proliferation- inhibiting portion of one of SEQ ID NOs.: 1 T 127 may 
comprise at least 10, at least 20, at least 25, at least 30, at least 50 or more than 50 consecutive 
bases of one of SEQ ID NOs.: 1-127. 

Another embodiment of the present invention is a method for inhibiting the activity or 
5 expression of a gene in an operon required for proliferation wherein the activity or expression of at 
least one gene in said operon is inhibited by an antisense nucleic acid comprising a sequence 
selected from the group consisting of SEQ ID NOs.: 1-127, said method comprising contacting a 
cell in a cell population with an antisense nucleic acid comprising at least a proliferation-inhibiting 
portion of said operon. The antisense nucleic acid comprises a sequence selected from the group 

10 consisting of SEQ ID NOs.: 1-127 or a proliferation inhibiting portion thereof. 

The method of Claim 68, wherein said cell is contacted with said antisense nucleic acid by 
introducing a plasmid which expresses said antisense nucleic acid into said cell population. The 
cell may be contacted with said antisense nucleic acid by introducing a phage which expresses said 
antisense nucleic acid into said cell population. The cell may be contacted with said antisense 

15 nucleic acid by expressing said antisense nucleic acid from the chromosome of cells in said cell 
population. The cell may be contacted with said antisense nucleic acid by introducing a promoter 
adjacent to a chromosomal copy of said antisense nucleic acid such that said promoter directs the 
synthesis of said antisense nucleic acid. The cell may be contacted with said antisense nucleic acid 
by introducing a retron which expresses said antisense nucleic acid into said cell population. The 

20 cell may be contacted with said antisense nucleic acid by introducing a ribozyme into said cell- 
population, wherein a binding portion of said ribozyme is complementary to said antisense 
oligonucleotide. The cell may be contacted with said antisense nucleic acid by introducing a 
liposome comprising said antisense oligonucleotide into said cell. The cell may be contacted with 
said antisense nucleic acid by electroporation of said antisense nucleic acid. The antisense nucleic 

25 acid may be a fragment comprising at least 10, at least 20, at least 25, at least 30, at least 50 or more 
than 50 consecutive bases of one of SEQ ID NOs.: 1-127. The antisense nucleic acid may be an 
oligonucleotide. 

Another embodiment of the present invention is a method for identifying a gene which is 
required for proliferation of a microorganism comprising: 
30 (a) contacting a microorganism other than E. coli with a nucleic acid selected from the 

group consisting of SEQ ID NOs.: 1-127; 

(b) determining whether said nucleic acid inhibits proliferation of said microorganism; 

and 

(c) identifying the gene in said microorganism which is inhibited by said nucleic acid. 
35 The microorganism may be a Gram negative bacterium. The microorganism may be selected from 

the group consisting of Aspergillus Jumigatus, Bacillus anthracis, Campylobacter jejuni, Candida 
albicans, Chlamydia pneumoniae, Chlamydia trachomatus, Clostridium botulinum, Cryptococcus 
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neoformans, Enterobacter cloacae, Enterococcus faecalis, Haemophilus influenzae, Helicobacter 
pylori, Klebsiella pneumoniae, Mycobacterium leprae, Mycobacterium tuberculosis, Neisseria 
gonorrhoeae, Pseudomonas aeruginosa, Saccharomyces cerevisae, Salmonella cholerasuis, 
Salmonella paratyphi, Salmonella typhi, Salmonella typhimurium, Staphylococcus aureus, 
5 Staphylococcus epidermidis, Streptococcus pneumoniae, Treponema pallidum, and Yersinia pestis 
or any species falling within the genera of any of the above species. The method may further 
comprise introducing said nucleic acid into a vector functional in said microorganism prior to 
introducing said inhibitory nucleic acid into said microorganism. 

Another embodiment of the present invention is a method for identifying a compound 

10 having the ability to inhibit proliferation of a microorganism comprising: 

(a) identifying in a first microorganism a homolog of a gene or gene product present in 
a second microorganism which is different than said first microorganism, wherein the activity or 
level of said gene or gene product is inhibited by a nucleic acid comprising a sequence selected 
from the group consisting of SEQ ID NOs. 1- 127 ; 

15 (b) identifying an inhibitory nucleic acid sequence which inhibits the activity of said 

homolog in said first microorganism; 

(c) contacting said first microorganism with a sublethal level of said inhibitory nucleic 
acid, thus sensitizing said first microorganism; 

(d) contacting the sensitized microorganism of step (c) with a compound; and 

20 (e) determining whether said compound inhibits proliferation of said sensitized 

microorganism. 

The determining step may comprise determining whether said compound inhibits proliferation of 
said sensitized microorganism to a greater extent than said compound inhibits proliferation of a 
nonsensitized microorganism. Step (a) may comprise identifying a homologous nucleic acid to a 

25 gene or gene product whose activity or level is inhibited by a nucleic acid selected from the group 
consisting of SEQ ID NOs. 1-127 or a nucleic acid encoding a homologous polypeptide to a 
polypeptide whose activity or level is inhibited by a nucleic acid selected from the group consisting 
of SEQ ID NOs. 1-127 by using an algorithm selected from the group consisting of BLASTN 
version 2.0 with the default parameters and FASTA version 3.0t78 algorithm with the default 

30 parameters to identify said homologous nucleic acid or said nucleic acid encoding a homologous 
polypeptide in a database. Step (a) may comprise identifying a homologous nucleic acid or a 
nucleic acid encoding a homologous polypeptide by identifying nucleic acids which hybridize to 
said first gene. Step (a) may comprise expressing a nucleic acid selected from the group consisting 
of SEQ ID NOs. 1-127 in said microorganism. The inhibitory nucleic acid may be an antisense 

35 nucleic acid. The inhibitory nucleic acid may comprise an antisense nucleic acid to a portion of 
said homolog. The inhibitory nucleic acid may comprise an antisense nucleic acid to a portion of 
the operon encoding said homolog. The step of contacting the first microorganism with a sublethal 
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level of said inhibitory nucleic acid may comprise directly contacting said microorganism with said 
inhibitory nucleic acid. The step of contacting the first microorganism with a sublethal level of said 
inhibitory nucleic acid may comprise expressing an antisense nucleic acid to said homolog in said 
microorganism. The gene product may comprise a polypeptide comprising a sequence selected 
5 from the group consisting of SEQ ID NOs.: 299-469. 

Another embodiment of the present invention is a compound identified using the method of 
the preceding paragraph. 

Another embodiment of the present invention is a method of identifying a compound 
having the ability to inhibit proliferation comprising: 
10 (a) contacting a microorganism other than E. coli with a sublethal level of a nucleic 

acid comprising a sequence selected from the group consisting of SEQ ID NOs. 1-127 or a portion 
thereof which inhibits the proliferation of E. coli, thus sensitizing said microorganism; 

(b) contacting the sensitized m icroorganism of step (a) with a compound; and 

(c) determining whether said compound inhibits proliferation of said sensitized 
15 microorganism. 

The determining step may comprise determining whether said compound inhibits proliferation of 
said sensitized microorganism to a greater extent than said compound inhibits proliferation of a 
nonsensitized microorganism. 

Another embodiment of the present invention is a compound identified using the methods 
20 of the preceding paragraph. 

Another embodiment of the present invention is a method for identifying a compound 
having activity against a biological pathway required for proliferation comprising: 

(a) sensitizing a cell by expressing a sublethal level of an antisense nucleic acid 
complementary to a nucleic acid encoding a gene product required for proliferation, wherein the 

25 activity or expression of said gene product is inhibited by an antisense nucleic acid comprising a 
sequence selected from the group consisting of SEQ ID NOs.: 1-127, in said cell to reduce the 
activity or amount of said gene product; 

(b) contacting the sensitized cell with a compound; and 

(c) determining whether said compound inhibits the growth of said sensitized cell. 

30 The determining step may comprise determining whether said compound inhibits the growth of said 
sensitized cell to a greater extent than said compound inhibits the growth of a nonsensitized cell. 
The cell may be selected from the group consisting of bacterial cells, fungal cells, plant cells, and 
animal cells. The cell may be a Gram negative bacterium. The Gram negative bacterium may be £ 
colL The cell may be selected from the group consisting of Aspergillus jumigatus, Bacillus 

35 anthracis, Campylobacter jejuni, Candida albicans, Chlamydia pneumoniae, Chlamydia 
trachomatus, Clostridium botulinum, Cryptococcus neoformans, Enterobacter cloacae, 
Enterococcus faecalis, Haemophilus influenzae, Helicobacter pylori, Klebsiella pneumoniae, 
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Mycobacterium leprae, Mycobacterium tuberculosis, Neisseria gonorrhoeae, Pseudomonas 
aeruginosa* Saccharomyces cerevisae, Salmonella cholerasuis, Salmonella paratyphi, Salmonella 
typhi, Salmonella typhimurium, Staphylococcus aureus, Staphylococcus epidermidis, Streptococcus 
pneumoniae, Treponema pallidum, and Yersinia pestis or any species falling within the genera of 
5 any of the above species. The antisense nucleic acid may be transcribed from an inducible 
promoter. The method may further comprise contacting the cell with an agent which induces 
expression of said antisense nucleic acid from said inducible promoter, wherein said antisense 
nucleic acid is expressed at a sublethal level. The inhibition of proliferation may be measured by 
monitoring the optical density of a liquid culture. The gene product may comprise a polypeptide 
10 comprising a sequence selected from the group consisting of SEQ ID NOs.: 299-469. 

Another embodiment of the present invention is a compound identified using the methods 
of the preceding paragraph. 

Another embodiment of the present invention is a method for identifying a compound 
having the ability to inhibit cellular proliferation comprising: 
1 5 (a) contacting a cell with an agent which reduces the activity or level of a gene product 

required for proliferation of said cell, wherein said gene product is a gene product whose activity or 
expression is inhibited by an antisense nucleic acid comprising a sequence selected from the group 
consisting of SEQ ID NOs.: 1-127; 

(b) contacting said cell with a compound; and 
20 (c) determining whether said compound reduces proliferation of said contacted cell. 

The determining step may comprise determining whether said compound reduces proliferation of 
said contacted cell to a greater extent than said compound reduces proliferation of cells which have 
not been contacted with said agent. The agent which reduces the activity or level of a gene product 
required for proliferation of said cell may comprise an antisense nucleic acid to a gene or operon 
25 required for proliferation. The agent which reduces the activity or level of a gene product required 
for proliferation of said cell may comprise a compound known to inhibit growth or proliferation of 
a microorganism. The cell may contain a mutation which reduces the activity or level of said gene 
product required for proliferation of said cell. The mutation may be a temperature sensitive 
mutation. The gene product may comprise a polypeptide comprising a sequence selected from the 
30 group consisting of SEQ ID NOs.: 299-469. 

Another embodiment of the present invention is a compound identified using the method of 
the preceding paragraph. 

Another embodiment of the present invention is a method for identifying the biological 
pathway in which a proliferation-required gene or its gene product lies, wherein said gene or gene 
35 product comprises a gene or gene product whose activity or expression is inhibited by an antisense 
nucleic acid comprising a sequence selected from the group consisting of SEQ ID NOs.: 1-127, said 
method comprising: 
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(a) expressing a sublethal level of an antisense nucleic acid which inhibits the activity 
of said proliferation-required gene or gene product in a cell; 

(b) contacting said cell with a compound known to inhibit growth or proliferation of a 
microorganism, wherein the biological pathway on which said compound acts is known; and 

5 (c) determining whether said cell is sensitive to said compound. 

The determining step may comprise determining whether said cell has a substantially greater 
sensitivity to said compound than a cell which does not express said sublethal level of said 
antisense nucleic acid and wherein said gene or gene product lies in the same pathway on which 
said compound acts if said cell expressing said sublethal level of said antisense nucleic acid has a 
10 substantially greater sensitivity to said compound than said cell which does not express said 
sublethal level of said antisense nucleic acid. 

The gene product may comprise a polypeptide comprising a sequence selected from the 
group consisting of SEQ ID NOs.: 299-469. 

Another embodiment of the present invention is a method for determining the biological 
15 pathway on which a test compound acts comprising: 

(a) expressing a sublethal level of an antisense nucleic acid complementary to a 
proliferation-required nucleic acid in a cell, wherein the activity or expression of said proliferation- 
required nucleic acid is inhibited by an antisense nucleic acid comprising a sequence selected from 
the group consisting of SEQ ID NOs.: 1-127 and wherein the biological pathway in which said 

20 proliferation-required nucleic acid or a protein encoded by said proliferation-required polypeptide 
lies is known, 

(b) contacting said cell with said test compound; and 

(c) determining whether said cell is sensitive to said test compound. 

The determining step may comprise determining whether said cell has a substantially greater 
25 sensitivity to said test compound than a cell which does not express said sublethal level of said 
antisense nucleic acid. The method may further comprise: 

(d) expressing a sublethal level of a second antisense nucleic acid complementary to a 
second proliferation-required nucleic acid in a second cell, wherein said second proliferation- 
required nucleic acid is in a different biological pathway than said proliferation-required nucleic 

30 acid in step (a); and 

(e) determining whether said second cell does not have a substantially greater 
sensitivity to said test compound than a cell which does not express said sublethal level of said 
second antisense nucleic acid, wherein said test compound is specific for the biological pathway 
against which the antisense nucleic acid of step (a) acts if said second cell does not have 

35 substantially greater sensitivity to said test compound. 

Another embodiment of the present invention is a purified or isolated nucleic acid 
comprising a sequence selected from the group consisting of SEQ ID NOs.: 1-127. 
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Another embodiment of the present invention is a compound which interacts with a gene or 
gene product whose activity or expression is inhibited by an antisense nucleic acid comprising one 
of SEQ ID NOs.: 1-127 to inhibit proliferation. 

Another embodiment of the present invention is a compound which interacts with a 
5 polypeptide whose expression is inhibited by an antisense nucleic acid comprising one of SEQ ID 
NOs.: 1-127 to inhibit proliferation. 

Another embodiment of the present invention is a method for manufacturing an antibiotic 
comprising the steps of screening one or more candidate compounds to identify a compound that 
reduces the activity or level of a gene product required for proliferation, said gene product 
10 comprising a gene product whose activity or expression is inhibited by an antisense nucleic acid 
comprising a sequence selected from the group consisting of SEQ ID NOs.: 1-127 and 
manufacturing the compound so identified. 

The screening step may comprise performing any one of the methods of identifying a 
compound described above. 
15 Another embodiment of the present invention is a method for inhibiting proliferation of a 

microorganism in a subject comprising administering a compound that reduces the activity or level 
of a gene product required for proliferation of said microorganism, said gene product comprising a 
gene product whose activity or expression is inhibited by an antisense nucleic acid comprising a 
sequence selected from the group consisting of SEQ ID NOs.: 1-127 to said subject. The method of 
20 subject may be selected from the group consisting of vertebrates, mammals, avians, and human 
beings. The gene product may comprise a polypeptide comprising a sequence selected from the 
group consisting of SEQ ID NOs.: 299-469. 

BRIEF DESCRIPTION OF THE DRAWINGS 
Figure 1 is an IPTG dose response curve in £. coli transformed with an IPTG-inducible 
25 plasmid containing either an antisense clone to the E. coli ribosomal protein rpIW (AS-rplW) which 
is required for protein synthesis and essential for cell proliferation, or an antisense clone to the elaD 
gene (AS-elaD) which is not known to be involved in protein synthesis and which is also essential 
for proliferation. 

Figure 2A is a tetracycline dose response curve in E. coli transformed with an IPTG- 
30 inducible plasmid containing antisense to rpIW gene (AS-rpIW) carried out in the presence of 0, 20 
or 50 uM IPTG. 

Figure 2B is a tetracycline dose response curve in E. coli transformed with an IPTG- 
inducible plasmid containing antisense to elaD gene (AS-elaD) carried out in the presence of 0, 20 
or 50 uM IPTG. 

35 Figure 3 is a graph showing die fold increase in tetracycline sensitivity of E. coli 

transfected with antisense clones to essential ribosomal protein genes L23 (AS-rpIW) and L7/L12 
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and L10 (AS-rplLrplJ). Antisense clones to genes known not to be involved in protein synthesis 
(atpB/E(AS-atpB/E), visC (AS-visC, elaD (AS-elaD), yohH (AS-yohH) are much less sensitive to 
tetracycline. 

5 

Definitions 

By "biological pathway" is meant any discrete cell function or process that is carried out by 
a gene product or a subset of gene products. Biological pathways include enzymatic, biochemical 
and metabolic pathways as well as pathways involved in the production of cellular structures such 
10 as cell walls. Biological pathways that are usually required for proliferation of microorganisms 
include, but are not limited to, cell division, DNA synthesis and replication, RNA synthesis 
(transcription), protein synthesis (translation), protein processing, protein transport, fatty acid 
biosynthesis, cell wall synthesis, cell membrane production, synthesis and maintenance, and the 
like. 

15 By "inhibit activity of a gene or gene product" is meant having the ability to interfere with 

the function of a gene or gene product in such a way as to decrease expression of the gene or to 
reduce the level or activity of a product of the gene. Agents which inhibit the activity of a gene 
include agents that inhibit transcription of the gene, agents that inhibit processing of the transcript 
of the gene, agents that reduce the stability of the transcript of the gene, and agents that inhibit 

20 translation of the mRNA transcribed from the gene. In microorganisms, agents which inhibit the 
activity of a gene can act to decrease expression of the operon in which the gene resides or alter the 
folding or processing of operon RNA so as to reduce the level or activity of the gene product. The 
gene product can be a non-translated RNA such as ribosomal RNA, a translated RNA (mRNA) or 
the protein product resulting from translation of the gene mRNA. Of particular utility to the present 

25 invention are antisense RNAs that have activities against the operons or genes to which they 
specifically hybridze. 

By "activity against a gene product" is meant having the ability to inhibit the function or to 
reduce the level or activity of the gene product in a cell. 

By "activity against a protein" is meant having the ability to inhibit the function or to 
30 reduce the level or activity of the protein in a cell. 

By "activity against a nucleic acid" is meant having the ability to inhibit the function or to 
reduce the level or activity of the nucleic acid in a cell. 

By "activity against a gene" is meant having the ability to inhibit the function or expression 
of the gene in a cell. 

35 By "activity against an operon" is meant having the ability to inhibit the function or reduce 

the level of one or more products of the operon in a cell. 

By "antibiotic" is meant an agent which inhibits the proliferation of a microorganism. 
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By "identifying a compound" is meant to screen one or more compounds in a collection of 
compounds such as a combinatorial chemical library or other library of chemical compounds or to 
characterize a single compound by testing the compound in a given assay and determining whether 
it exhibits the desired activity. 
5 By "inducer" is meant an agent or solution which, when placed in contact with a 

microorganism, increases transcription from a desired promoter. 

As used herein, "nucleic acid" means DNA or RNA. Thus, the terminology "the nucleic 
acid of SEQ ID NO: X" includes both the DNA sequence of SEQ ID NO: X and an RNA sequence 
in which the thymidines in the DNA sequence have been substituted with uridines in the RNA 

10 sequence and in which the deoxyribose backbone of the DNA sequence has been substituted with a 
ribose backbone in the RNA sequence. 

As used herein, "sublethal" means a concentration of an agent below the concentration 
required to inhibit all cell growth. 

DETAILED DESCRIPTION OF THE INVENTION 

15 The present invention describes a group of K coli genes and gene families required for growth 

and/or proliferation. A proliferation- required gene or gene family is one where, in the absence of a 
gene transcript and/or gene product, growth or viability of the microorganism is reduced or eliminated. 
Thus, as used herein the terminology "proliferation-required" or "required for proliferation" 
encompasses sequences where the absence of a gene transcript and/or gene product completely 

20 eliminates cell growth as well as sequences where the absence of a gene transcript and/or gene product 
merely reduces cell growth. These proliferation-required genes can be used as potential targets for the 
generation of new antimicrobial agents. To achieve that goal, the present invention also encompasses 
novel assays for analyzing proliferation-required genes and for identifying compounds which interact 
with the gene products of the proliferation-required genes. In addition, the present invention 

25 contemplates the expression of genes and the purification of the proteins encoded by the nucleic acid 
sequences identified as required proliferation genes and reported herein. The purified proteins can be 
used to generate reagents and screen small molecule libraries or other candidate compound libraries for 
compounds that can be further developed to yield novel antimicrobial compounds. The present 
invention also describes methods for identification of homologous genes in organisms other than E. 

30 colL 

The present invention utilizes a novel method to identify proliferation-required E. coli 
sequences. Generally, a library of nucleic acid sequences from a given source are subcloned or 
otherwise inserted into an inducible expression vector, thus forming an expression library. Although 
the insert nucleic acids may be derived from the chromosome of the organism into which the 
35 expression vector is to be introduced, because the insert is not in its natural chromosomal location, the 
insert nucleic acid is an exogenous nucleic acid for the purposes of the discussion herein. The term 
expression is defined as the production of an RNA molecule from a gene, gene fragment, genomic 
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fragment, or operon. Expression can also be used to refer to the process of peptide or polypeptide 
synthesis. An expression vector is defined as a vehicle by which a ribonucleic acid (RNA) sequence is 
transcribed from a nucleic acid sequence earned within the expression vehicle. The expression vector 
can also contain features that permit translation of a protein product from the transcribed RNA message 
5 expressed from the exogenous nucleic acid sequence carried by the expression vector. Accordingly, an 
expression vector can produce an RNA molecule as its sole product or the expression vector can 
produce a RNA molecule that is ultimately translated into a protein product 

Once generated, the expression library containing the exogenous nucleic acid sequences is 
introduced into an K coli population to search for genes that are required for bacterial proliferation. 

10 Because the library molecules are foreign to the population of K coli, the expression vectors and the 
nucleic acid segments contained therein are considered exogenous nucleic acid. 

Expression of the exogenous nucleic acid fragments in the test population of K coli containing 
the expression vector library is then activated. Activation of the expression vectors consists of 
subjecting the cells containing the vectors to conditions that result in the expression of the exogenous 

15 nucleic acid sequences carried by the expression vector library. The test, population of E. coli cells is 
then assayed to determine the effect of expressing the exogenous nucleic acid fragments on the test 
population of cells. Those expression vectors that, upon activation and expression, negatively impact 
the growth of the K coli screen population were identified, isolated, and purified for further study. 

A variety of assays are contemplated to identify nucleic acid sequences that negatively impact 

20 growth upon expression. In one embodiment, growth in K coli cultures expressing exogenous nucleic 
acid sequences and growth in cultures not expressing these sequences is compared. Growth 
measurements are assayed by examining the extent of growth by measuring optical densities. 
Alternatively, enzymatic assays can be used to measure bacterial growth rates to identify exogenous 
nucleic acid sequences of interest. Colony size, colony morphology, and cell morphology are 

25 additional factors used to evaluate growth of the host cells. Those cultures that failed to grow or grow 
with reduced efficiency under expression conditions are identified as containing an expression vector 
encoding a nucleic acid fragment that negatively affects a proliferation-required gene. 

Once exogenous nucleic acid sequences of interest are identified, they are analyzed. The first 
step of the analysis is to acquire the nucleic acid sequence of the nucleic acid fragment of interest To 

30 achieve this end, the insert in those expression vectors identified as containing a sequence of interest is 
sequenced, using standard techniques well known in the art. The next step of the process is to 
determine the source of the nucleic acid sequence. 

Determination of sequence source is achieved by comparing the obtained sequence data with 
known sequences in various genetic databases. The sequences identified are used to probe these gene 

35 databases. The result of this procedure is a list of exogenous nucleic acid sequences corresponding to a 
list that includeds novel bacterial genes required for proliferation as well as genes previously identified 
as required for proliferation. 
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The number of DNA and protein sequences available in database systems has been growing 
exponentially for years. For example, at the end of 1998, the complete sequences of Caenorhabditis 
elegans, Saccharomyces cerevisiae and nineteen bacterial genomes, including K coti were available. 
This sequence information is stored in a number of databanks, such as GenBank (the National Center 
5 for Biotechnology Information (NCBI), and is publicly available for searching. 

A variety of computer programs are available to assist in the analysis of the sequences stored 
within these databases. FastA, (W. R. Pearson (1990) "Rapid and Sensitive Sequence Comparison 
with FASTP and FASTA" Methods in Enzymology 183:63- 98), Sequence Retrieval System (SRS), 
(Etzold & Argos, SRS an indexing and retrieval tool for flat file data libraries. Comput. Appl. 

10 Biosci. 9:49-57, 1993) are two examples of computer programs that can be used to analyze 
sequences of interest. In one embodiment of the present invention, the BLAST family of computer 
programs, which includes BLASTN version 2.0 with the default parameters, or BLASTX version 
2.0 with the default parameters, is used to analyze nucleic acid sequences. 

BLAST, an acronym for "Basic Local Alignment Search Tool," is a family of programs for 

15 database similarity searching. The BLAST family of programs includes: BLASTN, a nucleotide 
sequence database searching program, BLASTX, a protein database searching program where the input 
is a nucleic acid sequence; and BLASTP, a protein database searching program. BLAST programs 
embody a fast algorithm for sequence matching, rigorous statistical methods for judging the 
significance of matches, and various options for tailoring the program for special situations. Assistance 

20 in using the program can be obtained by e-mail at blast@ncbi.nlm.nih.gov. 

Bacterial genes are often transcribed in polycistronic groups. These groups comprise operons, 
which are a collection of genes and intergenic sequences. The genes of an operon are co-transcribed 
and are often related functionally. Given the nature of the screening protocol, it is possible that the 
identified exogenous nucleic acid sequence corresponds to a gene or portion thereof with or without 

25 adjacent noncoding sequences, an intragenic sequence (i.e. a sequence within a gene), an intergenic 
sequence (i.e. a sequence between genes), a sequence spanning at least a portion of two or more genes, 
a 5' noncoding region or a 3' noncoding region located upstream or downstream from the actual 
sequence that is required for bacterial proliferation. Accordingly, determining which of the genes that 
are encoded within the operons are individually required for proliferation is often desirable. 

30 In one embodiment of the present invention, an operon is dissected to determine which gene or 

genes are required for proliferation. For example, the RegulonDB DataBase described by Huerta et al. 
(Nucl. Acids Res. 26:55-59, 1998), which may also be found on the website 
http^/www.cifh.unam.mx/Computational_Biology/regulondb/, may be used, to identify the boundaries 
of operons encoded within microbial genomes. A number of techniques that are well known in the art 

35 can be used to dissect the operon. In one aspect of this embodiment, gene disruption by homologous 
recombination is used to individually inactivate the genes of an operon that is thought to contain a gene 
required for proliferation. 
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Several gene disruption techniques have been described for the replacement of a functional 
gene with a mutated, non-functional (null) allele. These techniques generally involve the use of 
homologous recombination. The method described by Link et al. (J. Bacterid 1997 179:6228) 
serves as an excellent example of these methods as applicable to disruption of genes in E. colL This 
5 technique uses crossover PCR to create a null allele with an in-frame deletion of the coding region 
of a target gene. The null allele is constructed in such a way that sequences adjacent to the wild 
type gene (ca. 500 bp) are retained. These homologous sequences surrounding the deletion null 
allele provide targets for homologous recombination so that the wild type gene on the K coli 
chromosome can be replaced by the constructed null allele. 

10 The crossover PCR amplification product is subcloned into the vector pK03, the features of 

which include a chloramphenicol resistance gene, the counter-selectable marker sacB, and a 
temperature sensitive autonomous replication function. Following transformation of an E. coli cell 
population with such a vector, selection for cells that have undergone homologous recombination of 
the vector into the chromosome is achieved by growth on chloramphenicol at the non-permissive 

15 temperature of 43°C. Under these conditions, autonomous replication of the plasmid cannot occur 
and cell are resistant to chloramphinicol only if the chloramphenicol resistance gene has been 
integrated into the chromosome. Usually a single crossover event is responsible for this integration 
event such that the £ coli chromosome now contains a tandem duplication of the target gene 
consisting of one wild type allele and one deletion null allele separated by vector sequence. 

20 This new £. coli strain containing the tandem duplication can be maintained at permissive 

temperatures in the presence of drug selection (chloramphenicol). Subsequently, cells of this new 
strain are cultured at the permissive temperature 30°C without drug selection. Under these 
conditions, the chromosome of some of the cells within the population will have undergone an 
internal homologous recombination event resulting in removal of the plasmid sequences. 

25 Subsequent culturing of the strain in growth medium lacking chloramphenicol but containing 
sucrose is used to select for such recombinative resolutions. In the presence of the counter- 
selectable marker sacB, sucrose is rendered into a toxic metabolite. Thus, cells that survive this 
counter-selection have lost both the plasmid sequences from the chromosome and the autonomously 
replicating plasmid that results as a byproduct of recombinative resolution. 

30 There are two possible outcomes of the above recombinative resolution via homologous 

recombination. Either the wild type copy of the targeted gene is retained on the chromosome or the 
mutated null allele is retained on the chromosome. In the case of an essential gene, a single copy of 
the null allele would be lethal and such cells should not be obtained by the above procedure when 
applied to essential genes. In the case of a non-essential gene, roughly equal numbers of cells 

35 containing null alleles and cells containing wild type alleles should be obtained. Thus, the method 
serves as a test for essentiality of the targeted gene: when applied to essential genes, only cells with 
a wild type allele on the chromosome will be obtained. 
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Other techniques have also been described for the creation of disruption mutations in K 
coli. For example, Link et al. also describe inserting an in-frame sequence tag concommitantly with 
an in-frame deletion in order to simplify analysis of recombinants obtained. Further, Link et al. 
describe disruption of genes with a drug resistance marker such as a kanamycin resistance gene. 
5 Arigoni et al., (Arigoni, F. et al. A Genome-based Approach for the Identification of Essential 
Bacterial Genes, Nature Biotechnology 16: 851-856) describe the use of gene disruption combined 
with engineering a second copy of a test gene such that the expression of the gene is regulated by 
and inducible promoter such as the arabinose promoter to test the essentiality of the gene. Many of 
these techniques result in the insertion of large fragments of DNA into the gene of interest, such as 

10 a drug selection marker. An advantage of the technique described by Link et al. is that it does not 
rely on an insertion into the gene to cause a functional defect, but rather results in the precise 
removal of the coding region. This insures the lack of polar effects on the expression of genes 
downstream from the target gene. 

Recombinant DNA techniques can be used to express the entire coding sequences of the gene 

15 identified as required for proliferation, or portions thereof. The over-expressed proteins can be used as 
reagents for further study. The identified exogenous sequences are isolated, purified, and cloned into a 
suitable expression vector using methods well known in the art. If desired, the nucleic acids can 
contain the sequences encoding a signal peptide to facilitate secretion of the expressed protein. 

Expression of fragments of the bacterial genes identified as required for proliferation is also 

20 contemplated by the present invention. The fragments of the identified genes can encode a polypeptide 
comprising at least 5, at least 10, at least 15, at least 20, at least 25, at least 30, at least 35, at least 40, at 
least 45, at least 50, at least 55, at least 60, at least 65, at least 75, or more than 75 consecutive amino 
acids of a gene complementary to one of the identified sequences of the present invention. The nucleic 
acids inserted into the expression vectors can also contain sequences upstream and downstream of the 

25 coding sequence. 

When expressing the coding sequence of an entire gene identified as required for bacterial 
proliferation or a fragment thereof, the nucleic acid sequence to be expressed is operably linked to a 
promoter in an expression vector using conventional cloning technology. The expression vector can be 
any of the bacterial, insect, yeast, or mammalian expression systems known in the art Commercially 

30 available vectors and expression systems are available from a variety of suppliers including Genetics 
Institute (Cambridge, MA), Stratagene (La Jolla, California), Promega (Madison, Wisconsin), and 
Invhrogen (San Diego, California). If desired, to enhance expression and facilitate proper protein 
folding, the codon usage and codon bias of the sequence can be optimized for the particular expression 
organism in which the expression vector is introduced, as explained by Hatfield, et al., U.S. Patent No. 

35 5,082,767. Fusion protein expression systems are also contemplated by the present invention. 

Following expression of the protein encoded by the identified exogenous nucleic acid 
sequence, the protein is purified. Protein purification techniques are well known in the art. Proteins 
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encoded and expressed from identified exogenous nucleic acid sequences can be partially purified 
using precipitation techniques, such as precipitation with polyethylene glycol. Chromatographic 
methods usable with the present invention can include ion-exchange chromatography, gel filtration, use 
of hydroxyapaptite columns, immobilized reactive dyes, chromatofocusing, and use of high- 
5 performance liquid chromatography. Electrophoretic methods such one-dimensional gel 
electrophoresis, high-resolution two-dimensional polyacrylamide electrophoresis, isoelectric focusing, 
and others are contemplated as purification methods. Also, affinity chromatographic methods, 
comprising antibody columns, ligand presenting columns and other affinity chromatographic matrices 
are contemplated as purification methods in the present invention. 

10 The purified proteins produced from the gene coding sequences identified as required for 

proliferation can be used in a variety of protocols to generate useful antimicrobial reagents. In one 
embodiment of the present invention, antibodies are generated against the proteins expressed from the 
identified exogenous nucleic acid sequences. Both monoclonal and polyclonal antibodies can be 
generated against the expressed . proteins. Methods for generating monoclonal and polyclonal 

15 antibodies are well known in the art Also, antibody fragment preparations prepared from the produced 
antibodies discussed above are contemplated. 

Another application for the purified proteins of the present invention is to screen small 
molecule libraries for candidate compounds active against the various target proteins of the present 
invention. Advances in the field of combinatorial chemistry provide methods, well known in the art, to 

20 produce large numbers of candidate compounds that can have a binding, or otherwise inhibitory effect 
on a target protein. Accordingly, the screening of small molecule libraries for compounds with binding 
affinity or inhibitory activity for a target protein produced from an identified gene sequence is 
contemplated by the present invention. 

The present invention further contemplates utility against a variety of other pathogenic 

25 organisms in addition to E. colL For example, the invention has utility in identifying genes required for 
proliferation in prokaryotes and eukaryotes. For example, the invention has utility with protists, such 
as Plasmodium spp.; plants; animals, such as Entamoeba spp. and Contracaecum spp; and fungi 
including Candida spp., (e.g., Candida albicans), Saccharomyces cerevisiae, Cryptococcus 
neoformans, and Aspergillus Jumigatus. In one embodiment of the present invention, monera, 

30 specifically bacteria are probed in search of novel gene sequences required for proliferation. This 
embodiment is particularly important given the rise of drug resistant bacteria. 

The numbers of bacterial species that are becoming resistant to existing antibiotics are 
growing. A partial list of these organisms includes: Staphylococcus spp., such as S. aureus; 
Enterococcus spp., such as E.faecalis; Pseudomonas spp., such as P. aeruginosa, Clostridium spp., 

35 such as C. botulinum, Haemophilus sppt, such as H. influenzae, Enterobacter spp., such as E. 
cloacae, Vibrio spp., such as V. cholera; Moraxala spp., such as M catarrhalis; Streptococcus spp., 
such as S. pneumoniae, Neisseria spp., such as N. gonorrhoeae; Mycoplasma spp., such as 
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Mycoplasma pneumoniae; Salmonella typhimurium; Helicobacter pylori, Escherichia coli; and 
Mycobacterium tuberculosis. The sequences identified as required for proliferation in the present 
invention can be used to probe these and other organisms to identify homologous required 
proliferation genes contained therein. 
5 In one embodiment of the present invention, the nucleic acid sequences disclosed herein are 

used to screen genomic libraries generated from bacterial species of interest other than E coli. For 
example, the genomic library may be from Staphylococcus aureus, Pseudomonas aeruginosa, 
Enterobacter cloacae, Helicobacter pylori, Neisseria gonorrhoeae, Enterococcus faecalis, 
Streptococcus pneumoniae, Haemophilus influenzae, Salmonella typhimurium, Saccharomyces 

10 cerevisiae, Candida albicans, Cryptococcus neoformans, Aspergillus fumigatus, Klebsiella 
pneumoniae, Salmonella typhi, Salmonella paratyphi, Salmonella cholerasuis, Staphylococcus 
epidermidis, Mycobacterium tuberculosis, Mycobacterium leprae, Treponema pallidum, Bacillus 
anthracis, Yersinia pestis, Clostridium botulinum, Campylobacter jejuni, Chlamydia trachomatus, 
Chlamydia pneumoniae or any species falling within the genera of any of the above species. 

15 Standard molecular biology techniques are used to generate genomic libraries from various 
microorganisms. In one aspect, the libraries are generated and bound to nitrocellulose paper. The 
identified exogenous nucleic acid sequences of the present invention can then be used as probes to 
screen the libraries for homologous sequences. The homologous sequences identified can then be used 
as targets for the identification of new, antimicrobial compounds with activity against more than one 

20 organism. 

For example, the preceding methods may be used to isolate nucleic acids having a sequence 
with at least 97%, at least 95%, at least 90%, at least 85%, at least 80%, or at least 70% identity to a 
nucleic acid sequence selected from the group consisting of one of the sequences of SEQ ID NOS. 
1-127, 128-298, fragments comprising at least 10, 15, 20, 25, 30, 35, 40, 50, 75, 100, 150, 200, 300, 

25 400, or 500 consecutive bases thereof, and the sequences complementary thereto. Identity may be 
measured using BLASTN version 2.0 with the default parameters. (Altschul, S.F. et al. Gapped 
BLAST and PSI-BLAST: A New Generation of Protein Database Search Programs, Nucleic Acid 
Res. 25: 3389-3402 (1997)). For example, the homologous polynucleotides may have a coding 
sequence which is a naturally occurring allelic variant of one of the coding sequences described 

30 herein. Such allelic variants may have a substitution, deletion or addition of one or more 
nucleotides when compared to the nucleic acids of SEQ ID NOs: 1-127, 128-298 or the sequences 
complementary thereto. 

Additionally, the above procedures may be used to isolate nucleic acids which encode 
polypeptides having at least 99%, 95%, at least 90%, at least 85%, at least 80%, at least 70%, at 

35 least 60%, at least 50%, or at least 40% identity or similarity to a polypeptide having the sequence 
of one of SEQ ID NOs: 299-469or fragments comprising at least 5, 10, 15, 20, 25, 30, 35, 40, 50, 75, 
100, or 150 consecutive amino acids thereof as determined using the FASTA version 3.0t78 
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algorithm with the defauh parameters. Alternatively, protein identity or similarity may be 
identified using BLASTP with the default parameters, BLASTX with the defeult parameters, or 
TBLASTN with the default parameters. (Alschul, S.F. et al. Gapped BLAST and PS1-BLAST: A 
New Generation of Protein Database Search Programs, Nucleic Acid Res. 25: 3389-3402 (1997)). 
5 Alternatively, homologous nucleic acids or polypeptides may be identified by searching a 

database to identify sequences having a desired level of homology to a nucleic acid or polypeptide 
involved in proliferation or an antisense nucleic acid to a nucleic acid involved in microbial 
proliferation. A variety of such databases are available to those skilled in the art, including 
GenBank and GenSeq. In some embodiments, the databases are screened to identify nucleic acids 

10 or polypeptides having at least 97%, at least 95%, at least 90%, at least 85%, at least 80%, at least 
70%, at least 60%, or at least 50%, at least 40% identity or similarity to a nucleic acid or 
polypeptide involved in proliferation or an antisense nucleic acid involved in proliferation. For 
example, the database may be screened to identify nucleic acids homologous to one of SEQ ID Nos. 
1-127, 128-298 or polypeptides homologous to SEQ ID NOs. 299-469. In some embodiments, the 

15 database may be screened to identify homologous nucleic acids or polypeptides from organisms 
other than £. coli, including organisms such as Staphylococcus aureus, Pseudomonas aeruginosa, 
Enterobacter cloacae, Helicobacter pylori, Neisseria gonorrhoeae, Enterococcus faecalis, 
Streptococcus pneumoniae, Haemophilus influenzae, Salmonella typhimurium, Saccharomyces 
cerevisiae, Candida albicans, Cryptococcus neoformans, Aspergillus fumigatus, Klebsiella 

20 pneumoniae, Salmonella typhi, Salmonella paratyphi, Salmonella cholerasuis, Staphylococcus 
epidermidis, Mycobacterium tuberculosis, Mycobacterium leprae, Treponema pallidum, Bacillus 
anthracis, Yersinia pest is, Clostridium botulinum, Campylobacter jejuni,Chlamydia trachomatus, 
Chlamydia pneumoniae or any species falling within the genera of any of the above species. 

In another embodiment, gene expression arrays and microarrays can be employed. Gene 

25 expression arrays are high density arrays of DNA samples deposited at specific locations on a glass 
chip, nylon membrane, or the like. Such arrays can be used by researchers to quantify relative gene 
expression under different conditions. Gene expression arrays are used by researchers to help 
identify optimal drug targets, profile new compounds, and determine disease pathways. An 
example of this technology is found in U.S. Patent No. 5807522. 

30 It is possible to study the expression of all genes in the genome of a particular microbial 

organism using a single array. For example, the arrays from Genosys consist of 12 x 24 cm nylon 
filters containing PCR products corresponding to 4290 ORFs from E. coli. 10 ngs of each are 
spotted every 1.5 mm on the filter. Single stranded labeled cDNAs are prepared for hybridization 
to the array (no second strand synthesis or amplification step is done) and placed in contact with the 

35 filter. Thus the labeled cDNAs are of "antisense" orientation. Quantitative analysis is done by 
phosphorimager. 
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Hybridization of cDNA made from a sample of total cell mRNA to such an array followed 
by detection of binding by one or more of various techniques known to those in the art results in a 
signal at each location on the array to which cDNA hybridized. The intensity of the hybridization 
signal obtained at each location in the array thus reflects the amount of mRNA for that specific 
5 gene that was present in the sample. Comparing the results obtained for mRNA isolated from cells 
grown under different conditions thus allows for a comparison of the relative amount of expression 
of each individual gene during growth under the different conditions. 

Gene expression arrays may be used to analyze the total mRNA expression pattern at 
various time points after induction of an antisense nucleic acid against a proliferation-required gene. 

10 Analysis of the expression pattern indicated by hybridization to the array provides information on 
whether or not the target gene of the antisense nucleic acid is being affected by antisense induction, 
how quickly the antisense is affecting the target gene, and for later timepoints, what other genes are 
affected by antisense expression. For example, if the antisense is directed against a gene for 
ribosomal protein L7/L12 in the SOS subunit, its targeted mRNA may disappear first and then other 

IS mRNAs may be observed to increase, decrease or stay the same. Similarly, if the antisense is 
directed against a different SOS subunit ribosomal protein mRNA (e.g. L25), that mRNA may 
disappear first followed by changes in mRNA expression that are similar to those seen with the 
L7/L12 antisense expression. Thus, the mRNA expression pattern observed with an antinsense 
nucleic acid against a proliferation required gene may identify other proliferation-required nucleic 

20 acids in the same pathway as the target of the antisense nucleic acid. In addition, the mRNA 
expression patterns observed with candidate drug compounds may be compared to those observed 
with antisense nucleic acids against a proliferation-required nucleic acid. If the mRNA expression 
pattern observed with the candidate drug compound is similar to that observed with the antisense 
nucleic acid, the drug compound may be a promising therapeutic candidate. Thus, the assay would 

25 be useful in assisting in the selection of candidate drug compounds for use in screening methods 
such as those described below. 

In cases where the source of nucleic acid deposited on the array and the source of the 
nucleic acid being hybridized to the array are from two different organisms, gene expression arrays 
can identify homologous genes in the two organisms. 

30 The present invention also contemplates additional methods for screening other 

microorganisms for proliferation-required genes. In this embodiment, the conserved portions of 
sequences identified as proliferation-required can be used to generate degenerate primers for use in the 
polymerase chain reaction (PCR). The PCR technique is well known in the art The successful 
production of a PCR product using degenerate probes generated from the sequences identified herein 

35 would indicate the presence of a homologous gene sequence in the species being screened. This 
homologous gene is then isolated, expressed, and used as a target for candidate antibiotic compounds. 
In another aspect of this embodiment, the homologous gene is expressed in an autologous organism or 

-23- 



WO 01/34810 



PCT/US00/30950 



in a heterologous organism in such a way as to alter the level or activity of a homologous gene required 
for proliferation in the autologous or heterologus organism. In still another aspect of this embodiment, 
the homologous gene or portion is expressed in an anti sense orientation in such a way as to alter the 
level or activity of a nucleic acid required for proliferation of an autologous or heterologous organism. 
5 The homologous sequences to proliferation-required genes identified using the techniques 

described herein may be used to identify proliferation-required genes of organisms other than E. 
colU to inhibit the proliferation of organisms other than ft. coli by inhibiting the activity or reducing 
the amount of the identified homologous nucleic acid or polypeptide in the organism other than E. 
coli, or to identify compounds which inhibit the growth of organisms other than E. coli as described 
10 below. 

In another embodiment of the present invention, E coli sequences identified as required for 
proliferation are transferred to expression vectors capable of function within non-£ coli species. As 
would be appreciated by one of ordinary skill in the art, expression vectors must contain certain 
elements that are species specific. These elements can include promoter sequences, operator 

15 sequences, repressor genes, origins of replication, ribosomal binding sequences, termination 
sequences, and others. To use the identified exogenous sequences of the present invention, one of 
ordinary skill in the art would know to use standard molecular biology techniques to isolate vectors 
containing the sequences of interest from cultured bacterial cells, isolate and purify those 
sequences, and subclone those sequences into an expression vector adapted for use in the species of 

20 bacteria to be screened. 

Expression vectors for a variety of other species are known in the art. For example, Cao et 
al. report the expression of steroid receptor fragments in Staphylococcus aureus, J. Steroid 
Biochem Mol Biol. 44(1): I- 11 (1993). Also, Pla et al. have reported an expression vector that is 
functional in a number of relevant hosts including: Salmonella typhimurium, Pseudomonas putida, 

25 and Pseudomonas aeruginosa. J. Bactcriol. l72(8):4448-55 (1990). These examples demonstrate 
the existence of molecular biology techniques capable of constructing expression vectors for the 
species of bacteria of interest to the present invention. 

Following the subcloning of the identified nucleic acid sequences into an expression vector 
functional in the microorganism of interest, the identified nucleic acid sequences are conditionally 

30 transcribed to assay for bacterial growth inhibition. Those expression vectors found to contain 
sequences that, when transcribed, inhibit bacterial growth are compared to the known genomic 
sequence of the pathogenic microorganism being screened or, if the homologous sequence from the 
organism being screened is not known, it may be identified and isolated by hybridization to the 
proliferation-required E coli sequence interest or by amplification using primers based on the 

35 proliferation-required £. coli sequence of interest as described above. 

The antisense sequences from the second organism which are identified as described above 
may then be operably linked to a promoter, such as an inducible promoter, and introduced into the 
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second organism. The techniques described herein for identifying R coli genes required for 
proliferation may thus be employed to determine whether the identified sequences from a second 
organism inhibit the proliferation of the second organism. 

Antisense nucleic acids required for the proliferation of organisms other than E. coli or the 
5 genes corresponding thereto, may also be hybridized to a microarray containing the R coli ORFs to 
gauge the homology between the E. coli sequences and the proliferation-required nucleic acids from 
other organisms. For example, the proliferation-required nucleic acid may be from Staphylococcus 
aureus, Pseudomonas aeruginosa, Enterobacter cloacae, Helicobacter pylori, Neisseria 
gonorrhoeae, Enterococcus faecalis, Streptococcus pneumoniae, Haemophilus influenzae, 

10 Salmonella typhimurium, Saccharomyces cerevisiae, Candida albicans, Cryptococcus neoformans 9 
Aspergillus fumigatus y Klebsiella pneumoniae^ Salmonella typhi, Salmonella paratyphi, Salmonella 
cholerasuis, Staphylococcus epidermidis, Mycobacterium tuberculosis, Mycobacterium leprae, 
Treponema pallidum, bacillus anthracis. Yersinia pestis, Clostridium botulinum, Campylobacter 
jejuni or Chlamydia trachomatus, Chlamydia pneumoniae or any species falling within the genera 

15 of any of the above species. The proliferation-required nucleic acids from an organism other than 
E. coli may be hybridized to the array under a variety of conditions which permit hybridization to 
occur when the probe has different levels of homology to the sequence on the microarray. This 
would provide an indication of homology across the organisms as well as clues to other possible 
essential genes in these organisms. 

20 In still another embodiment, the exogenous nucleic acid sequences of the present invention 

that are identified as required for bacterial growth or proliferation can be used as antisense therapeutics 
for killing bacteria. The antisense sequences can be directed against the proliferation-required genes 
whose sequence corresponds to the exogenous nucleic acid probes identified here (i.e. the antisense 
nucleic acid may hybridize to the gene or a portion thereof). Alternatively, antisense therapeutics can 

25 be directed against operons in which proliferation-required genes reside (i.e. the antisense nucleic acid 
may hybridize to any gene in the operon in which the proliferation-required genes reside). Further, 
antisense therapeutics can be directed against a proliferation-required gene or portion thereof with or 
without adjacent noncoding sequences, an intragenic sequence (i.e. a sequence within a gene), an 
intergenic sequence (i.e. a sequence between genes), a sequence spanning at least a portion of two or 

30 more genes, a 5' noncoding region or a 3' noncoding region located upstream or downstream from the 
actual sequence that is required for bacterial proliferation or an operon containing a proliferation- 
required gene. 

In addition to therapeutic applications, the present invention encompasses the use of nucleic 
acid sequences complementary to sequences required for proliferation as diagnostic tools. For 
35 example, nucleic acid probes complementary to proliferation-required sequences that are specific for 
particular species of microorganisms can be used as probes to identify particular microorganism 
species in clinical specimens. This utility provides a rapid and dependable method by which to identify 
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the causative agent or agents of a bacterial infection. This utility would provide clinicians the ability to 
prescribe species specific antimicrobial compounds to treat such infections. In an extension of this 
utility, antibodies generated against proteins translated from mRNA transcribed from proliferation- 
required sequences can also be used to screen for specific microorganisms that produce such proteins in 
5 a species-specific manner. 

The following examples teach the genes of the present invention and a subset of uses for the £ 
coli genes identified as required for proliferation. These examples are illustrative only and are not 
intended to limit the scope of the present invention. 

EXAMPLES 

10 The following examples are directed to the identification and exploitation of K coli genes 

required for proliferation. Methods of gene identification are discussed as well as a variety of methods 
to utilize the identified sequences. 

Genes Identified as Required for Proliferation of & coli 

15 Exogenous nucleic acid sequences were cloned into an inducible expression vector and 

assayed for growth inhibition activity. Example 1 describes the examination of a library of 
exogenous nucleic acid sequences cloned into the IPTG-inducible expression vector pLexSBA 
(Krause et al., J. Mol. Biol. 274: 365 (1997)). Upon activation or induction, the expression vectors 
produced an RNA molecule corresponding to the subcloned exogenous nucleic acid sequences. 

20 The RNA product was in an antisense orientation with respect to the K coli genes from which it 
was originally derived. This antisense RNA then interacted with sense mRNA produced from 
various E. coli genes and interfered with or inhibited the translation of the sense messenger RNA 
(mRNA) thus preventing protein production from these sense mRNA molecules. In cases where 
the sense mRNA encoded a protein required for the proliferation, bacterial cells containing an 

25 activated expression vector failed to grow or grew at a substantially reduced rate. Similar results 
have also be obtained in cases where the gene encodes a non-translated RNA, such as a ribosomal 
RNA. 

EXAMPLE 1 

Inhibition of Bacterial Proliferation after IPTG induction 
30 To study the effects of transcriptional induction in liquid medium, growth curves were carried 

out by back diluting cultures 1:200 into fresh media with or without 1 mM IPTG and measuring the 
OD450 every 30 minutes (min). To study the effects of transcriptional induction on solid medium, 10 2 , 
10 3 , 10 4 , 10 5 , 10 6 , 10 7 and 10 8 fold dilutions of overnight cultures were prepared. Aliquots of from 0.5 
to 3 }xl of these dilutions were spotted on selective agar plates with or without 1 mM IPTG. After 
35 overnight incubation, the plates were compared to assess the sensitivity of the clones to IPTG. 

Of the numerous clones tested, some clones were identified as containing a sequence that 
inhibited K coli growth after IPTG induction. Accordingly, the gene to which the inserted nucleic acid 
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sequence corresponds, or a gene within the operon containing the inserted nucleic acid, may be 
required for proliferation in E colL 

Characterization of Isolated Clones Negatively Affecting K coli Proliferation 

5 Following the identification of those expression vectors that, upon expression, negatively 

impacted E. coli growth or proliferation, the inserts or nucleic acid fragments contained in those 
expression vectors were isolated for subsequent characterization. Inserts in expression vectors of 
interest were subjected to nucleic acid sequence determination. 

EXAMPLE 2 

10 Nucleic Acid Sequence Determination ofldentified Clones Expressing Nucleic Acid Fragments with 

Detrimental Effects of E coli Proliferation 
The nucleotide sequences for the exogenous identified sequences were determined using 
plasmid DNA isolated using QIAPREP (Qiagen, Valencia, CA) and methods supplied by the 
manufacturer. The primers used for sequencing the inserts were 5' - TGTTTATCAGACCGCTT - 3* 
1 5 (SEQ ID NO: 1) and 5* - ACAATTTCACACAGCCTC - 3' (SEQ ID NO: 2). These sequences flank 
the polylinker in pLEXSBA. Sequence identification numbers (SEQ ID NOs) for the identified inserts 
are listed in Table I and discussed below. 

EXAMPLE 3 

Comparison Of Isolated Sequences to Known Sequences 

20 The nucleic acid sequences of the subcloned fragments obtained from the expression vectors 

discussed above were compared to known E. coli sequences in GenBank using BLAST version 1.4 or 
version 2.0.6 using the following default parameters: Filtering off, cost to open a gap=5, cost to extend 
a gapF=2, penalty for a mismatch in the blast portion of run=-3, reward for a match in the blast portion 
of run=l, expectation value (e)=l0.0, word size=l 1, number of one-line descriptions^ 00, number of 

25 alignments to show (B)=100. BLAST is described in Altschul, J Mol Biol. 215:403-10 (1990). 
Expression vectors were found to contain nucleic acid sequences in both the sense and antisense 
orientations. The presence of known genes, open reading frames, and ribosome binding sites was 
determined by comparison to public databases holding genetic information and various computer 
programs such as die Genetics Computer Group programs FRAMES and CODONPREFERENCE. 

30 Clones were designated as "antisense" if the cloned fragment was oriented to the promoter such that 
the RNA transcript produced was complementary to the expressed mRN A from a chromosomal locus. 
Clones were designated as "sense" if they coded for an RNA fragment that was identical to a portion of 
a wild type mRNA from a chromosomal locus. 

The sequences described in Examples 1-2 that inhibited bacterial proliferation and contained 

35 gene fragments in an antisense orientation are listed in Table I. This table lists each identified sequence 
by: a sequence identification number, a Molecule Number, a gene to which the identified sequence 
corresponds, listed according to the National Center for Biotechnology Information (NCBI), Blattner 
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(Science 277:1453-1474(1997); also contains the K coli K-12 genome sequence), or Rudd (Micro, 
and Mol. Rev. 62:985-1019 (1998)), nomenclatures. The CONTIG numbers for each identified 
sequence is shown, as well as the location of the first and last base pairs located on the K coli 
chromosome. A Molecule Number with a indicates a clone corresponding to an intergenic 
sequence. 



10 



TABLE I 



Identified Clones with Corresponding Genes and Operons 



Clone Name 


Seq 


Molecule 


- Gene 


Gene 


Gene 


Contig 


Start 


Stop . 




ID 


No. 


(NCBI) 


(Blat- 
tner) 


(Rudd) 








626.024 


1 


EcXA056 


f320 


blU3 


ycfS 


AE000211 


7631 


7971 


E1M10000I16B1 


2 


EcXA056b 


ycfS 


bl 1 13 


ycfS 


AE000211 


7658 


7847 


E1M10000155F12 


3 


EcXA056c 


ycfS 


blll3 


ycfS 


AE000211 


7649 


8037 




A 




3Tp 


hdfH 7 
UH\J I / 


arp 








EIM10000I44B6 


5 


EcXA057b 


arp 


b4017 


arp 


AE000474 


14187 


14385 


Z60-P16 


6 


EcXA058 


rp!C 


b3320 


rplC 


AE000408 


10002 


10338 


Z80-D10 


7 


EcXA059 


ypjA 


b2647 


ypjA 


AE000349 
AE0OO35O 


10402 
1 


10493 
728 


P33-1.C22 


8 


EcXA060 


rplR 


b3304 


rplR 


AE000408 


2763 


2958 


E1M10000I61C06 


9 


EcXA060b 


RplR; 
rplF 


b3304; 
b3305 


RplR; 
rplF 


AEOOO408 


3006 


3477 


P35-7 


10 


EcXA06t 


malE 


b4034 


malE 


AE000476 


11925 


12089 


P35-8 




EcXA062 


rep 


b3778 


rep 


AE000454 


4438 


4111 


P38-1.G20 


12 


EcXA063 


elaD 


b2269 


elaD 


AE000316 


9912 


9581 


E1M100G0107H4 


13 


EcXA063b 


elaD 


b2269 


elaD 


AE000316 


9520 


9389 


E1M10000122B03 


14 


EcXA063c 


elaD 


b2269 


elaD 


AE000316 


9979 


9715 


E1M1O0OO139BO7 


15 


EcXA063d 


elaD 


b2269 


elaD 


AE000316 


10171 


9533 


E1MI0000152G3 


16 


EcXA063e 


elaD 


b2269 


elaD 


AE000316 


9535 


9406 


E1MI0000143G03 


17 


EcXA063f 


elaD 


b2269 


elaD 


AE000316 


10104 


9869 


E1MI0000131H01 


18 


EcXA063h 


elaD 


b2269 


elaD 


AE000316 


9953 


9746 


P3 19-4.06 


19 


EcXA064 


CyoE 


b0428 


cyoE 


AE000149 


2140 


2293 


P323-1.MIO 


20 


EcXA065 


DgoA 


b3692 


YidU 


AE000446 


6005 


6272 


E1MI00001I1E4 


21 


EcXA065b 


DgoA 


b3692 


YidU 


AE000446 


6005 


6133 


P323-8.P1 


22 


EcXA066 


Rpml 


b!717 


Rpml 


AE000266 


10240 


10390 



-28r 



WO 01/34810 



PCT/US00/30950 



Clone Name 


Seq 


Molecule 


Gene 


Gene 


Gene 


Contig 


Start 


Stop 




ID 


No. 


(NCBI) 


(Blat- 


(Rudd) 
















tner) 










E1MI0000137G09 


23 


EcXA066b 


RplT;r 


bl716; 


RplT; 


AE000266 


9947 


10525 








pml 


bl717 


Rpml 








P326-22.E17 


24 


EcXA067 


xylF 


b3566 


XylF 


AE000434 


288 


95 


P326-9.K2 


25 


EcXA068 


YhfL; 


b3369; 


yhfL; 


AE00O413 


581 


306 








yhfM 


b3370 


yhfM 








P327-50.M10 


26 


EcXA069 


RplD; 


b33!9; 


rplD; 


AEO0O408 


9747 


9900 








rpIC 


b3320 


rpIC 








E1M10000110G1 


27 


EcXA069b 


RplD; 


b33l9; 


rplD; 


AE000408 


9789 


9933 








rpIC 


b3320 


rpIC 








E1M10000121D08 


28 


EcXA069c 


RplD; 


b3319; 


RplD; 


AE000408 


9737 


10002 








rpIC 


b3320 


rpIC 








E1M10000I36HI 


29 


EcXA069d 


RplD; 


b3319; 


RplD; 


AE000408 


9707 


10241 








rpIC 


b3320 


rpIC 








E1M10000126E08 


30 


EcXA069c 


rpIC 


b3320 


RpIC 


AEO0O408 


10157 


10379 


El M 100001 37C04 


31 


EcXA069f 


RplD; 


b3319; 


RplD; 


AEO0O408 


9783 


10007 








rpIC 


b3320 


RpIC ' 








E1M10000106G02 


32 


EcXA069g 


rpIC 


b3320 


RpIC 


AE000408 


9814 


10154 


E1M10000146H01 


33 


EcXA069h 


RplD; 


b33l9; 


RplD; 


AE000408 


9715 


9890 








rpIC 


b3320 


RpIC 








E1M10000148C02 


34 


EcXA069i 


RplD; 


b33I9; 


RplD; 


AE000408 


9740 


9980 








rpIC 


b3320 


RpIC 








P328-20.P20 


35 


EcXA070 


YbcQ 


b0551 


YbcQ 


AE000160 


7883 


7661 


1065-12 


36 


EcXA071 


ffh 


b2610 


Ffh 


AE000346 


11978 


12129 














AE000347 


1 


319 


E1M10000101D6 


37 


EcXA071b 


RpsP; 


b2609; 


RpsP; 


AE000346 


11911 


12129 








ffh 


b2610 


ffh 




















AE000347 


1 


348 


P332-U.C20 


38 


EcXA072 


rccJ 


b2892 


recJ 


AE000372 


12047 


12144 














AE000273 


x 


108 


I P334-5.H2 


39 


EcXA073 


htrE 


b0139 


htrE 


AE000123 


5431 


5548 


P338-4.M21 


40 


EcXA073b 


htrE 


b0139 


htrE 


AE000123 


5447 


5593 


E1M10000119A04 


41 


EcXA073c 


htrE 


b0139 


htrE 


AE000123 


5419 


5642 


E1MI0000137C03 


42 


EcXA073d 


htrE; 


b0139; 


htrE; 


AE000123 


5414 


5919 








ecpD 


b0140 


ecpD 








E1MI0000124G03 


43 


EcXA073c 


htrE 


b0139 


htrE 


AE000123 


5332 


5515 


P334-8.L7 


44 


EcXA074 


yciR 


bl285 


yciR 


AE000226 


8045 


8371 


1053-37 


45 


EcXA074b 


yciR 


bl285 


yciR 


AE000226 


6079 


6293 


P335-3.J14 


46 


ECXA075 


SfinD 


b0532 


sfmD 


AE000159 


3235 


3115 


P335-8.H8 


47 


EcXA076 


mviM 


bl068 


mviM 


AE000207 


11140 


10983 














AE000208 


50 


1 


P342-3 


48 


EcXA077 


B2145 


b2145 


yeiS 


AE000303 


9025 


8831 
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Clone Name 


Seq 
ID 


Molecule 
No. 


Gene 
(NCBI) 


Gene 
(Blat- 
tner) 


Gene 
(Rodd) 


Contig 


Start 


Stop 


E1M10000I06GIO 


49 


EcXA077b 


B2145 


b2145 


yeiS 


AE000303 


9007 


8841 


E1M10000144F3 


50 


EcXA077c 


B2145 


b2145 


yeiS 


AE000303 


9052 


8827 


X3S177-4 


51 


EcXA078 


ycgB 


bl 188 


ycgB 


AE000217 


3945 


4129 


P317-2.A3 


52 


EcXA079 


yedV 


bl968 


yedX 


AE000288 


5289 


5475 


E1M10000I51C04 


53 


EcXA079b 


yedV 


b!968 


yedV 


AE000288 


5179 


5515 


E1M10000162G05 


54 


EcXA079c 


yedV 


bl968 


yedV 


AE000288 


5313 


5503 


E1M10000I67F04 


55 


EcXA079d 


yedV 


bl968 


yedV 


AE000288 


5293 


5531 


E1MI0000167G04 


56 


EcXA079e 


yedV 


bl%8 


yedV 


AE000288 


5293 


5531 


X3S204-7 


57 


EcXA080 


rpIV 


b3315 


RplV 


AE000408 


7444 


7770 


E1M10000111C3 


58 


EcXA080b 


RplV; 


03315; 


RplV; 


AE000408 


7633 


7898 








rpsS 


b3316 


rpsS 








E1M10000I31B07 


59 


EcXA080c 


RplV; 


b33l5; 


RplV; 


AE000408 


7686 


7871 








rpsS 


b3316 


rpsS 








E1M1000O131CO7 • 


60 


EcXAOSOd 


RplV; 


b3315; 


RplV; - 


AE000408 


7723 


7860 








rpsS 


D3316 


rpsS 








E1M10000144G6 


61 


EcXA080e 


rpIV 


b3315 


RplV 


AE000408 


7580 


7762 


E1MI0000144C2 


62 


EcXA080f 


RplV; 


b3315; 


RplV; 


AE000408 


7650 


7784 








rpsS 


b3316 


rpsS 








E1M10000107G2 


63 


EcXA081 


rpsP 


b2609 


RpsP 


AE000346 


11957 


12097 


MC9.6 


64 


EcXA082 


hybC; 


D2994; 


HybC; 


AE000382 


4419 


4562 








hybB 


b2995 


hybB 








BI8-2.N2I 


65 


EcXA083 


hrpB 


b0148 


HrpB 


AE000124 


3024 


2955 


P336-14.F20 


66 


EcXA084 


B1399 


b!399 


PaaX 


AE000237 


164 


1 














AE000236 


12073 


12006 


985.P2I 


67 


EcXA085 


AgaZ; 


b3132; 


agaZ; 


AE000394 


lOt 11 


10705 








agaV 


03133 


agaV 








Z92-K24 


68 


EcXA086 


rplQ 


b3294 


RplQ 


AE000407 


7653 


8349 


E1M10000101CI2 


69 


EcXA086b 


rplQ 


b3294 


RplQ 


AE000407 


7748 


8075 


E1M10000103D11 


70 


EcXA086c 


RplQ 


b3294 


RplQ 


AE000407 


7652 


8051 


E1M10000I27D09 


71 


EcXA086d 


rplQ; 


b3294; 


RplQ; 


AE000407 


7806 


8129 








rpoA 


b3295 


rpoA 








E1M10000152D8 


72 


EcXA086e 


rplQ; 


b3294; 


ipiQ; 


•AE000407 


7950 


8146 








rpoA 


b3295 


rpoA 








SC17.1 


73 


EcXA087 


YehW 


b2128 


YehW 


AE000302 


915 


1226 


SC2I.I 


74 


EcXA088 


RpIO 


b330l 


RpIO 


AE000408 


1743 


1907 


E1M10000107G8 


75 


EcXA089 


YadT 


b0158 


YadT 


AE000125 


4489 


4639 


fclMlUUUUl IjL-o 


76 


CnVAnQA 

bcAAuyu 


unac. 


DU1 tv* 




AFflnni?7 

r\EAJ\J\Jt£ 1 


11T70U 


IvOJU 


EIML0000107B2 


77 


ECXA091 


YkgE 


b0306 


YkgE 


AE000137 


9375 


9261 


E1M10000I07C3 


78 


EcXA092 


bl497 


bl497 


YdcM 


AE000247 


689 


908 


E1M10000107H9 


79 


EcXA093 


YohM 


b2106 


YohM 


AE000299 


9423 


9166 


E1MI0000I09A11 


80 


EcXA094 


YfjW 


b2642 


YfjW 


AE000349 


7160 


6851 
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Clone Name 


Seq 


Molecule 


Gene 


Gene 


Gene 


Contig 


Start 


Stop 




ID 


No. 


(NCBI) 


(BUt- 


(Rudd) 
















tner) 










E1M10000160D07 


81 


EcXA094b 


YfjW 


b2642 


YfjW 


AE000349 


7118 


6932 


E1M10000161A05 


82 


EcXA094c 


YfjW 


b2642 


YijW 


AE000349 


6381 


5980 


E1M10000155A06 


83 


EcXA094d 


YfjW 


b2642 


YfjW 


AE000349 


6893 


6749 


E1M10000111A7 


84 


EcXA095 


b2758 


b2758 


YgcJ 


AE000359 


4983 


5069 


E1M10000107E2 


85 


EcXA096 


YgcM; 


b2765; 


ygcM; 


AE0O036O 


5320 


5190 








ygcN 


b2766 


ygcN 








E1M10000115E3 


86 


EcXA097 


yhcB 


b3233 


YhcB 


AE000402 


8070 


7864 


EIM10000107B3 


87 


EcXA097b 


yhcB; 


b3233; 


yhcB; 


AE000402 


8168 


7922 








degQ 


b3234 


degQ 








EiM 100001 62F03 


88 


EcXA097c 


yhcB 


b3233 


yhcB 


AE000402 


8111 


7874 


E1M10000127H07 


89 


EcXA097d? 


yhcB 


b3233 


yhcB 


AE000402 


8092 


7808 


EIM10000I63C04 


90 


EcXA097e 


yhcB; 


D3233; 


yhcB; 


AE000402 


8159 


7874 








degQ? 


b3234 


degQ 








EIM10000115G2 


91 


EcXA098 


rpoA 


b3295 


RpoA 


AE000407 


8254 


8453 


E1M10000144A8 


92 


EcXA098b 


RplO; 


b3294; 


RplQ; 


AE000407 


7841 


8118 








rpoA 


b3295 


rpoA 








E1MI0000101H9 


93 


EcXA099 


RpsN; 


b3307; 


RpsN; 


AE000408 


4403 


4826 








rplE 


b3308 


RplE 








E1M100001I1F9 


94 


EcXAlOO 


RpmH; 


b3703; 


RpmH; 


AE000447 


7555 


7395 








mpA 


b3704 


RnpA 








E1M10000119D02 


95 


EcXAlOOb 


rpmH; 


b3703; 


RpmH; 


AE000447 


7581 


7395 








rnpA 


b3704 


RnpA 








EIM10000106F05 


96 


EcXAlOOc 


rpmH; 


b3703; 


RpmH; 


AE000447 


7594 


7359 








rnpA 


b3704 


RnpA 








E1M10000152H8 


97 


EcXAlOOd 


RpmH; 


b3703; 


RpmH; 


AE000447 


7630 


7340 








mpA 


b3704 


RnpA 








E1M10000115H1 


98 


EcXAlOl 


yihK 


03871 


TypA 


AE000462 


8811 


8629 


E1M10000101H7 


99 


EcXA102 


adiY 


b4H6 


AdiY 


AE000484 


1980 


2171 


E1M10000109A02 


100 


EcXA103 


yjhB 


b4279 


YjhB 


AE000498 


8776 


8536 


E1M10000113A11 


101 


EcXA104 


hsdS 


b4348 


HsdS 


AE000505 


6319 


6495 


EIM10000I25A2 


102 


EcXAI04b 


hsdS 


b4348 


HsdS 


AE000505 


6277 


6526 


E1M10000I03A5 


103 


EcXA105 


ydaU 


bl359 


YdaU 


AE000233 


4497 


4306 


E1M10000135B2 


104 


EcXA106 


ybbV 


b0510 


YbbV 


AE000157 


3796 


3624 


EIM1(XK)0131G10 


105 


EcXA106 


ybbV 


b0510 


YbbV 


AE000157 


3796 


3624 


EIM100001I0A12 


106 


EcX A 1 07 


yegO 


b2076 


YegO 


AE000297 


14471 


14330 


EIM10000H0E9 


107 


EcXA108 


yigK 


b3824 


YigK 


AE000458 


3709 


3964 


E1M1OOOO133A06 


108 


EcXA109 


modC 


b0765 


ModC 


AE000179 


2414 


2180 


E1M10OOO133B08 


109 


EcXAllO 


ynaF; 


b!376; 


YnaF; 


AE000234 


8011 


8149 








bl377 


bl377 


OmpN 








E1M10000106E09 


110 


EcXAIlOb 


ynaF; 


M376- 


YnaF; 


AE000234 


7967 


8207 








bl377 


b!377 


OmpN 
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Clone Name 


Seq 


Molecule 


Gene 


Gene 


Gene 


Contig 


Start 


Stop 




ID 


No. 


(NCB1) 


(Blat- 


(Rudd) 
















tner) 










E1M10000I60G07 


111 


EcXAllQc 


ynaF; 


bl376; 


YnaF; 


AE000234 


7990 


8114 








bl377 


bl377 


OmpN 








SC13.1 


112 


EcXAllOd 


ynaF 


bl376 


YnaF 


AE000234 


8027 


8243 


E1M10000155B05 


113 


EcXAIlOe 


ynaF; 


bl376; 


YnaF; 


AE000234 


7992 


8139 








D1377 


bl377 


ompN 








E1M10000133D09 


114 


EcXAllI 


ppdA 


b2826 


PpdA 


AE000366 


4876 


5068 


E1M10000162B08 


115 


EcXAlllb 


ppdA 


b2826 


PpdA 


AE000366 


4968 


5084 


E1M10000133E01 


116 


EcXA112 


yrfF 


b3398 


YrfF 


AE000415 


5835 


5712 


E1M10000101A7 


117 


EcXAU3 


ybbQ; 


bO509; 


YbbQ; 


AE000I57 


3753 


3466 








ybbV 


b0510 


ybbV 








E1M10000131F04 


118 


EcXA113b 


ybbQ; 


b0509; 


YbbQ; 


AE000157 


3781 


3536 








ybbV 


b0510 


ybbV 








E1M10000I59A09 


119 


EcXA113c 


ybbQ; 


b0509; 


YbbQ; 


AE000157 


3781 


3257 








ybbV 


b0510 


ybbV 








E1M10000166F09 


120 


EcXAU3d 


ybbV 


D0510 


YbbV 


AE000157 


3784 


3624 


E1MI0000121E07 


121 


EcXAI14 


b2352; 


b2352; 


YfdH; 


AE000323 


10110 


9882 








b2353 


b2353 


yfdl 




















AE000324 


357 


1 


E1M10000121F06 


122 


EcXA115 


ygeF 


b2850 


YgeF 


AE000369 


570 


304 


E1M10000I40B05 


123 


EcXA115b 


ygcF 


b2850 


YgeF 


AE000369 


512 


312 


E1M10000148H09 


124 


EcXAll5c 


ygeF 


b2850 


YgeF 


AE000369 


607 


361 


E1M10000I64A02 


125 


EcXA115d 


ygcF 


b2850 


YgcF 


AE000369 


555 


411 


E1M100O0I2IG05 


126 


EcXA116 


insB_3 


b0021 




AE000135 


5258 


5726 


E1M10000136D3 


127 


EcXAlH 


rhsA 


b3593 


RhsA 


AE000437 


4125 


3529 



EXAMPLE 4 

Identification of Genes and their Corresponding Operons Affected by Antisense Inhibition 
The sequencing of the entire E. coli genome is described in Blattner et al., Science 277:1453- 
5 . 1474(1997) and the sequence of the genome is listed in GenBank Accession No.U00096. The 
operons to which the proliferation-inhibiting nucleic acids correspond were identified using 
RegulonDB and information in the literature. The coordinates of the boundaries of these operons on 
the E. coli genome are listed in Table III. Table II lists the molecule numbers of the inserts containing 
the growth inhibiting nucleic acid fragments, the genes in the operons corresponding to the inserts, the 
10 SEQ ID NOs of the genes containing the inserts, the SEQ ID NOs of the proteins encoded by the 
genes, the start and stop points of the genes on the E. coli genome, the orientation of the genes on the 
genome, whether the operons are predicted or documented, and the predicted functions of the genes. 
The identified operons, their putative functions, and whether or not the genes are presently thought to 
be required for proliferation are discussed below. 
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Functions for the identified genes were determined by using either Blattner functional class 
designations or by comparing identified sequence with known sequences in various databases. A 
variety of biological functions were noted for the genes to which the clones of the present invention 
correspond. The functions for the genes of interest appear in Table n. 
5 The proteins that are listed in Table II are involved in a wide range of biological functions. 



TABLE II 

All Operon Data with Whole Chromosome Coordinates 



Molecule 
Number 


Gene 


Seq ID 

No, 
(gene) 


Seq ID 

No. 
(protein) 


Start 


Stop 


Operon 


Blattner Functional 

Class 


Predicted Function 


EcXA056 


ycfS 


128 


299 


1168635 


1169597 


predicted 


hypothetical, unclassified, 
unknown 




EcXA057 


arp 


129 


300 


4217880 


4220066 


predicted 
operon 


Fatty acid and 
phospholipid metabolism 


ankyrin repeat protein 


EcXA058 


rpsQ 


130 


301 


3445951 


3446205 


documented 


Translation, post- 
trans] at ional modification 






rpmC 


131 


302 


3446205 


3446396 




Translation, post- 
translational modification 






rplP 


132 


303 


3446396 


3446806 




Translation, post- 
translational modification 






rpsC 


133 


304 


3446819 


3447520 




Translation, post- 
translational modification 






rplV 


134 


305 


3447538 


3447870 




Translation, post- 
translationa) modification 






rpsS 


135 


306 


3447885 


3448163 




Translation, post- 
translational modification 






rp!B 


136 


307 


3448180 


3449001 




Translation, post- 
translational modification 


translation 




rplW 


137 


308 


3449019 


3449321 




Translation, post- 
translatiortal modification 


translation 




rplD 


138 


309 


3449318 


3449923 




Translation, post- 
translational modification 






rplC 


139 


310 


3449934 


3450563 




Translation, post- 
translational modification 






rpsJ 


140 


311 


3450596 


3450907 




Translation, post- 
translational modification 




EcXA059 


ypjA 


141 


312 


2776167 


2780876 


predicted 
operon 


Translation, post- 
translational modification 




ECXA060 


rpmj 


142 


313 


3440255 


3440371 


documented 


Translation, post- 
translational modification 






prlA 
rplO 


143 
144 


314 
315 


3440403 
3441742 


3441734 
3442176 




Putative transport proteins 
Translation, post- 
translatbnal modification 






rpmD 


145 


316 


3442180 


3442359 




Translation, post- 
translational modification 






rpsE 


146 


317 


3442363 


3442866 




Translation, post- 
translational modification 
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Seq ID 

No. 
(gene) 


Seq ID 

No. 
(protein) 


Start 


Stop 


Operon 


Blattner Functional 
Class 


Predicted Function 




rpIR 


147 


318 


3442881 


3443234 




Translation, post- 


















trandattonal modification 






rplF 


148 


319 


3443244 


3443777 




Translation* post- 


translation 
















translational modification 






rpsH 


149 


320 


3443790 


3444182 




Translation, post- 


















translationa] modification 






rpsN 


150 


321 


3444216 


3444521 




Translation, post- 


















translational modification 






rplE 


151 


322 


3444536 


3445075 




Translation, post- 


translation 
















translational modification 






rplX 


152 


323 


3445090 


3445404 




Translation, post- 


















translational modification 






rpIN 


153 


T1A 

324 


1AACA 1 C 

3445415 


1AA 

3443786 




Translation, post- 


















translational modification 




EcXA061 


malE 


154 


325 


4242808 


4243998 


documented 


Transport and binding 


















proteins 






malF 


155 


326 


4241110 


4242654 




Transport and binding 




• 














proteins 






malG 


156 


327 


4240205 


4241095 




Transport and binding 


















proteins 




ECAAUOZ 


rep 


1 j t 








preuicicu 


riW A r*>«l if* nt inn 

l/in/\ replication, 
















operon 


iv com o i null un, 


















mruHifl/vitirtn an/1 ronair 

inuuiitvaiiun ailU repair 




CbAAVQj 


52269 


158 




2380733 


2381944 




9t it nti \t& pnTvnwc 

ruioiivc ciizyiiK>s 


puutuvc 














operon 






EcXA064 


cyoE 


159 


330 


446039 


446929 


documented 


Energy metabolism 






cyoA 


160 


331 


449887 


450834 




Energy metabolism 






cyoB 


161 


332 


447874 


449865 




Energy metabolism 






cyoC 


162 


333 


447270 


447884 




Energy metabolism 






cvoD 


163 


334 


446941 


447270 




Energy metabolism 




EcXA065 


dgoA 


164 


335 


3869477 


3871240 


predicted 


Carbon compound 


2-Oxo-3- 
















catabolism 


deoxygalactonate 6- 


















phosphate aldolase 




dgoK 


165 


336 


3871224 


3872401 




Carbon compound 


















catabolism 


* 




yidW 


166 


337 


3872401 


3872787 




Hypothetical, 


















UnvlaaMlICUj UillUIUWIl 






b3694 


167 


338 


3872099 


3872395 




Putative regulatory 


















protein 




EcXA066 


rpIT 


168 


339 


1797417 


1797773 


documented 


Translation, post- 


















translational modification 






rpml 


169 


340 


1797826 


1798023 




Translation, post- 


















translational modification 






in/C 


170 


341 


1798120 


1798662 




Translation, post- 


translation 
















translational modification 






thrS 


171 


342 


1798666 


1800594 




Translation, post- 


















translational modification 




EcXA067 


xylF 


172 


343 


3728760 


3729752 




Transport and binding 


















proteins 




EcXA068 


yhfL 


173 


344 


3497085 


3497252 


predicted 


Hypothetical, 
















operon 


unclassified, unknown 
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Gcdc 


ScqlD 

No. 
(g«e) 


SeqlD 

Na 
(protein) 


Start 


Stop 


Opera □ 


Blattacr Functional Predicted Fuactkm 
Class 




yhfM 


174 


345 


3497496 


3498884 


predicted 


Putative Transport 














operon 






yhfN 


175 


346 


3498884 


3499927 




Putative Transport 




yhfO 


176 


347 


3499890 


3500339 




Hypothetical, 
















imrlnccifiprf unknown 


EcXA069 




x 


x 






same operon 
















as EcXA058 




EcXA070 


ybcQ 


177 


348 


573179 


573562 


predicted 


Hypothetical, 














operon 


unclassified, unknown 


EcXA071 


fm 


178 


349 


2744454 


2745815 


predicted 


Transport and binding 














operon 


proteins 


EcXA072 


rccJ 


179 


350 


3034393 


3036126 


predicted 


Transcription, RNA 














operon 


processing and 
















degradation 




dsbC 


180 


351 


3036132 


3036842 




Cell structure 




xerD 


181 


352 


3036867 


3037763 




DNA replication, 
















recombination, 
















modification and repair 


EcXA073 


ccpD 


182 


353 


155461 


156201 


documented 


Cell structure 




htrE 


183 


354 


152829 


155426 




Cell structure 


hCAAU ( «♦ 


yciR 


1 OA 


J55 


1 1AT7B1 
1 J42/OI 


IJ^rf f DO 


predicted 


nyporjieti cat, 














operon 


unclassified, unknown 


CCAAU ij 


simL. 


Ioj 


JDO 


CC01Q7 


CCQDOQ 

JDaooy 


predicted 


r uuuivc cnapcronin 














operon 






stmU 


186 


357 


CCQOIft 

558920 


561523 




LCll structure 




sfinH 


187 


358 


561565 


562542 




Cell processes (incl. 
















adaptation, protection) 




sfmF 


188 


359 


562553 


563068 




Cell processes (incl. 
















aaapuuiQn, pruiecuun j 


EcXA076 


rimJ 


189 


360 


! 124785 


1 125369 


predicted 


Translation, post- j 














operon 


translational modification 




yccH 


190 


361 


1125380 


1 126027 




Hypothetical, 
















unclassified, unknown 




mviM 


191 


362 


1126029 


1 126952 




Cell processes (incl. 
















adaptation, protection) 


EcXA077 


sanA 


192 


363 


2230898 


2231617 


predicted 






b2145 


193 


364 


2231620 


2231859 




Hypothetical, 
















unclassified, unknown 


EcXA078 


ycgB 


194 


365 


1234932 


1236464 


predicted 


Hypothetical, 














operon 


unclassified, unknown 


EcXA079 


ycdV 


195 


366 


2034816 


2036174 


predicted 


Hypothetical, 
















unclassified, unknown 




ycdW 


196 


367 


2036174 


2036893 




Hypothetical, 
















unclassified, unknown 


ECXA080 




X 


X 






same operon 
















asEcXA058 




EcXA081 


b2107 


197 


368 


2184800 


2185318 


predicted 


Hypothetical, 
















unclassified, unknown 




b2!06 


198 


369 


2183937 


2184761 




Hypothetical, 
















unclassified, unknown 


EcXA082 


bybG 


199 


370 


3137731 


3137979 


documented 


Energy metabolism 




hybF 


200 


371 


3137992 


3138333 




Energy metabolism 




hybE 


201 


372 


3138326 


3138814 




Energy metabolism ! 
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Molecule 
Number 


Gene 


ScqlD 

No. 
(gene) 


SeqID 

No. 
(protein) 


Start 


Stop 


Operoo 


BUttner Functional 
Class 


Predicted Function 




hybD 


202 


373 


3138807 


3139301 




Energy metabolism 






hybC 


203 


374 


3139301 


3141004 




Energy metabolism 






hybB 


204 


375 


3141001 


3142179 




Energy metabolism 






hybA 


205 


376 


3142169 


3143155 




Energy metabolism 




EcXA083 


hipB 


206 


377 


162060 


164534 




Transcription, RNA 


















processing and 


















degradation 




EcXA084 


bl399 


207 


378 


1461563 


1462513 




Hypothetical, 


















unutuojiicu, luUuiuwn 






bl400 


208 


379 


1462495 


1463085 




Hypothetical, 


















unclassified, unknown 




ECXA085 


agaZ 


209 


380 


3276555 


3277835 


predicted 


Carbon compound 


putative tagaiose Di- 
















catabolism 


phosphate kinase 2 




agaV 


210 


381 


3277822 


3278331 




Central intermediary 


r 1 2> system, (bllo- 
















metabolism 


AGA) 




agaW 


211 


382 


3278342 


3278743 




Central intermediary 


PTS system (EIIC) 
















metabolism 






agaA 


212 


383 


3278763 


3279266 




Central intermediary . 


putative N-NAG-6- 
















metabolism 


phosphatedeacetylase 




agaS 


213 


384 


327961 7 


3280771 




Central intermediary 


putative tagalosc-6- 
















metabolism 


phosphate 


















aldose/ketose 


















isomerase 




agaY 


214 


385 


3280784 


3281644 




Central intermediary 


tagatose-bisphosphate 
















metabolism 


aldolase 2 


EcXA086 


rpsM 


215 


386 


3439752 


3440108 


documented 


Translation, post- 


















translational modification 






rpsK 


216 


387 


3439346 


3439735 




Translation, post- 


















translationaJ modification 






rpsD 


217 


388 


3438692 


3439312 




Translation, post- 


















translational modification 






rpoA 


218 


389 


3437677 


3438666 




Translation, post* 


















translation al modification 






rplQ 


219 


390 


3437253 


3437636 




Translation, post- 


















translation al modification 




EcXA087 


yehW 


220 


391 


2213765 


2214496 


predicted 


Hypothetical, 


















unclassified, unknown 






yehX 


221 


392 


2214501 


2215427 




Hypothetical, 


















unclassified, unknown 






yehY 


222 


393 


2215420 


2216577 




Hypothetical, 


















unclassified, unknown 






yehZ 


223 


394 


2216584 


2217501 




Hypothetical, 


















unclassified, unknown 




EcXA088 




X 


X 






same as 
























ECXA089 


yadS 


224 


395 


177662 


178462 


predicted 


Hypothetical, 
















operon 


unclassified, unknown 






yadT 


225 


396 


177662 


178462 




Hypothetical, 


















unclassified, unknown 






. p 6 


226 


397 


178455 


179153 
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MolecBlc 
Number 


Gene 


Scq ID 

No. 
(gene} 


Scq ID 

No. 
(protein) 


Start 


Stop 


Operon 


BUttner Functional 
Class 


Predicted Function 


EcXA090 


IpxA 


227 


398 


202560 


203348 


predicted 


















operon 








tpxB 


228 


399 


203348 


204496 










mhB 


229 


400 


204493 


205089 






RnaaseH2 




dnaE 


230 


401 


205126 


208608 






DNA pol in subunit 


EcXA091 


vksE 


231 


402 


320832 


321551 


predicted 


Hypothetical, 


















unclassified, unknown 






vkeF 


232 


403 


321562 


322989 




Hypothetical 


















unclassified, unknown 








233 


404 


322829 


323677 




Hypothetical, 


















unclassified, unknown 




EcXA092 


b!497 


234 


405 


1577657 


1578829 




Hypothetical, 


















tinclastifiVH unkmurn 






bl498 


235 


406 


1 578866 


1580581 




Hvnothetical 


















unclassi ficd imknnwn 




EeXA093 

LV.AAU7J 


yohM 


236 


407 


2183937 


2184761 


nredieted 


Hvnoth eri cftl 


















unclassified unknown 






b2107 


237 


408 


2184800 


2185318 




Hvnothetical 


















unclassified, unknown 




EcXA094 


vfiW 


238 


409 


2771339 


2773042 


nredicted 


Hypothetical, 


















unclassified, unknown 






52758 


239 


410 


2879074 


2880165 


nrerlicted 


H v no I h Pl i cfll 


















unclassified, unknown 




EcXA096 




240 


41 1 


2890237 


2890602 


predicted 


Hypothetical, 


















unclassified, unknown 






ygcN 


241 


412 


2890650 


2891951 




Hypothetical, 


















unclassified, unknown 






b2767 


242 


413 


2891906 


2892202 




Hypothetical, 


















unclassified, unknown 






b2768 


243 


414 


2892219 


2892794 




Hypothetical, 


















unclassified, unknown 




FcYA097 


yhcB 


244 


415 


3377820 


3378224 


nrrHirtprl 


Hvrwthptiral 
















operon 


unclassified, unknown 








74^ 


416 


J J / OJ to 


1379745 


picuiucu 


Tmnclatlnn rvict- 
1 lallalaimilj pUM~ 
















operon 


translation al modification 






hhoB 


246 


417 


3379835 


3380902 






















translational modification 




FcXA098 


rpsM 


747 


418 


3439752 


3440108 


rtn r i im p n t pH 


Translation nnct- 

1 1 all dial lUli, Uvdi* 


















translational modification 






rpsK 


248 


419 


3439346 


3439735 




Translation, post- 


















translattonal modification 






rpsD 


249 


420 


3438692 


3439312 




Translation, post- 


















transIationaJ modification 






ipoA 


250 


421 


3437677 


3438666 




Translation, post- 


















translational modification 






rplO 


251 


422 


3437253 


3437636 




Translation, post- 


















translational modification 




hcXAUVV 




X 


X 






same as 


















EcXA060 






EcXAlOO 


rpmH 


252 


423 


3881965 


3882105 


documented 


Translation, post- 


















translational modification 






rnpA 


253 


424 


3882122 


3882481 




DNA replication, 


















recombination, 


















modification and repair 
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Molecule 
Number 


Gene 


Seq ID 

No- 
te*™) 


Seq ID 

No, 
(protein) 


Start 


Stop 


Operon 


wanner ¥ u actional 
Class 


Predicted r unction 


EcXAIOl 


yihK 


254 


425 


4055987 


4057762 


predicted 


Hypothetical, 
















operon 


unclassified, unknown 




EcXA102 


adi 


255 


426 


4335832 


4338102 


documented 


Putative regulatory 


biodegradative acid- 
















proteins 


induced arginine 


















decarboxylase 




adiY 




427 


4334746 


4335507 




Ammo nc\A hioevntheerc 


















and metabolism 




EcXAI03 


yjhB 


257 


428 


4501566 


4502843 


predicted 


Hypothetical, 
















operon 


unclassified, unknown 






yjhC 


258 


429 


4502840 


4503973 




Hypothetical, 


















unclassified, unknown 




EcXAI04 


hsdS 


259 


430 


4577638 


4579032 


documented 


DNA replication, 


host specificity 
















recombination, 


















modification and repair 






hsdM 


260 


431 


4579029 


4580618 




DNA replication, 


















recombination, 


















modification and repair 




EcXA105 


bl357 


261 


432 


1418389 


1418685 


predicted 


Hypothetical, 


















unclassified, unknown 






bl358 


262 


433 


1418708 


1419130 




Hypothetical, 


















unclassified, unknown 






ydaU 


263 


434 


1419143 


1420000 




Hypothetical, 


















unclassified, unknown 






bl360 


264 


435 


1420007 


1420753 




Hypothetical, 


















unclassified, unknown 






bl361 


265 


436 


1420725 


1421336 




Hypothetical, 


















unclassified, unknown 






bl362 


266 


437 


1421363 


1421668 




Hypothetical, 


















unclassified, unknown 




EcXA106 


ybbQ 


267 


438 


535810 


536688 


predicted 


Hypothetical, 


















unclassified, unknown 






ybbV 


268 


439 


536720 


536998 




Hypothetical, 


















unclassified, unknown 






bOSU 


269 


440 


536998 


538311 




Hypothetical, 


















unclassified, unknown 




EcXA107 


yegM 


270 


441 


2151891 


2153285 


predicted 


Hypothetical, 


















unclassified, unknown 






yegN 


271 


442 


2153285 


2156407 




Hypothetical, 


















unclassified, unknown 






yego 


272 


443 


2156408 


2159485 




Hypothetical, 


















unclassified, unknown 






ycgB 


273 


444 


2159486 


2160901 




Hypothetical, 


















unclassified, unknown 




EcXA108 




274 


445 


4006046 


4006462 




Hypothetical, 


















unclassified, unknown 




EcXA109 


modA 


275 


446 


794312 


795085 


documented 


Transport and binding 


molybdate uptake 
















proteins 


: 




modB 


276 


447 


795085 


795774 




Transport and binding 


















proteins 






modC 


277 


448 


795777 


796835 




Transport and binding 


















proteins 




EcXAIiO 


ynaF 


278 


449 


1433209 


1433715 


predicted 


Hypothetical, 


















unclassified, unknown 






bl377 


279 


450 


1433784 


1434917 


predicted 


Hypothetical, 


















unclassified, unknown 




EcXAlll 


recC 


280 


451 


2957082 


2960450 


predicted 


Transcription, RNA 


















processing and 
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Molecule 
Number 


Gene 


ScqlD 

No. 


Seq ID 

No. 
(pro too) 


Stirt 


Stop 


Operoo 


Blattner Functional 
Class 


Predicted Function 
















degradation 






ppdL 


281 


452 


2960463 


2960786 




Other known genes 


prepilin peptidase 


















dependent protein C 




ygdB 


282 


453 


2960771 


2961136 




Hypothetical, 


















unclassified, unknown 






ppdB 






XVOl 1 /J 


£701 /JO 




umer Known genes 


prep u ui pcpuuAc 


















dependent protein B 




ppdA 


284 


455 


2961729 


2962199 




Other known genes 


prepilin peptidase 


















dependent protein A 


P-va 1 17 

JCCA/\ 1 lx 


yrcr 


i-OJ 


456 


3524107 




nrf/iirt^rt 


Hvnnthptir^l 


















unclassified, unknown 






yrfG 


286 


457 


3526262 


3526975 




Hypothetical, 


















unclassified, unknown 






yrfH 


287 


458 


3526986 


3527387 




Hypothetical, 


















unclassified, unknown 






yrfl 


288 


459 


3527406 


3528290 




Hypothetical, 


















unclassified, unknown 




EcXA113 




X 


X 






same as 


















EcXA106 






c-YA 1 14 


o£jjV 






7465 J17 5 


7466717 


prcaiucu 


nypuuitnjcaj, 


















iinclfKcificd nnknnwn 

UIILICujIIIVU, UIUUIUVTII 






DZJjI 




HQ I 




74A7 1 54 




nypuuicucui, 


















unvioaaiiKU, luimiu vvn 






52352 


291 


462 


2467151 


2468482 




Hvoothetical 


















UllwliXSJlllCU, lUlKJHJvVM 








7Q7 




246878 1 


2469125 




Hv ivi th pt i est 1 


















UllUliual I Ivll, U1IWIUWII 




p r YA 1 1 * 
tt/W 1 1 J 


ygeF 






2988576 


2989022 


pi CU1CICU 


Hvmfhrtirjil 






















FrYAI 16 


tncR 1 
in5D_J 


704 


465 


289873 


290376 


nrrrlirt^rl 
prcxiincu 


nhaop tranennenn or 
piiagc, uaiispwuii, ui 


















nlscmtrl 

piasniiu 






insA 3 


295 


466 


290295 


290570 




phage, transposon, or 


















plasmid 




ECXA117 


rhsA 


296 


467 


3759810 


3763943 


predicted 


Hypothetical, 


















unclassified, unknown 






yibA 


297 


468 


3763964 


3763806 




Hypothetical, 


















unclassified, unknown 






yiW 


298 


469 


3764848 


3765549 




Hypothetical, 


















unclassified, unknown 





Functions for the identified genes were determined by using either Blattner functional class 
designations or by comparing identified sequence with known sequences in various databases. A 
variety of biological functions were noted for the genes to which the clones of the present invention 
5 correspond. Biological functions for genes that lie on the same operon as an identified gene have also 
been made. The functions for the genes of interest appear in Table n. 

The genes of interest have a variety of biological functions. For example, genes that are 
thought to function as transport or binding proteins, that participate in translation or post-translationa! 
modification, that are involved in carbon compound catabolism, that are thought to be enzymes, 

-39- 



WO 01/34810 PCT/US00/30950 

participate in cell processes, energy metabolism and biosynthetic functions appear in Table II. Genes 
that are involved in cell structure, transcription, RNA processing and degradation also appear in Table 
II. 

Several of the expression vectors contain fragments that correspond to genes of unknown 
5 function or if the function is known, it is not known whether the gene is essential. For example, 
EcXA056, 057, 059, 063, 064, 065, 067, 068, 070, 073, 074, 075, 076, 077, 078, 079, 081, 084, 085, 
087, 089, 091, 092, 093, 094, 095, 096, 097, 101, 102, 103, 105, 106, 107, 108, 109, 1 10, 111, 112, 
1 13, 1 14, 1 1 5 and 1 17 are all exogenous nucleic acid sequences that correspond to £ coli proteins that 
have no known function or where the function has not been shown to be essential or nonessential. 

1 0 The present invention reports a number of novel £ coli genes and operons that are required for 

proliferation. From the list of clone sequences identified here, each was identified to be a portion of a 
gene in an operon required for the proliferation of £ coli. Cloned sequences corresponding to genes 
already known to be required for proliferation in £ coli include EcXA058, 060, 066, 069, 071, 080, 
086, 088, 090, 098, 099 and 100 are exogenous nucleic acid sequences that correspond to £ coli genes 

15 that are known to be required for cellular proliferation. The remaining identified sequences correspond 
to £ coli genes previously undesignated as required for proliferation in the art 

An interesting observation of the present invention is that there are also several sequence 
fragments that correspond to £ coli genes that are not thought to be required for £ coli proliferation. 
Nevertheless, under the conditions described above, the antisense expression of these gene fragments 

20 causes a reduction in cell growth. This result implies that the genes corresponding to the identified 
sequences are actually required for proliferation or are in operons required for proliferation. Molecule 
Nos. corresponding to these genes are EcXA061, 062, 072, 082, 083, 104 and 1 16. 

Following identification of the sequences of interest, these sequences were localized into 
operons. Since bacterial genes are expressed in a polycistronic manner, the antisense inhibition of a 

25 singte gene in an operon might effect the expression of all the other genes on the operon or the genes 
down stream from the single gene identified. In order to determine which of the gene products in an 
operon are required for proliferation, each of the genes contained within an operon may be analyzed for 
their effect on viability as described below. 

30 



35 
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TABLE HI 
Operon Boundaries 



Molecule 
Number 


Start 


Stop 


EcXA056 


1168635 


1169597 


EcXA057 


4217880 


4220066 


EcXA059 


2776167 


2780876 


EcXA061 


4240205 


4243998 


EcXA062 


3958292 


3960313 


EcXA063 


2380733 


2381944 


EcXA064 


446039 


450834 


EcXA065 


3869477 


3872395 


EcXA067 


3728760 


3729752 


EcXA068 


3497085 


3500339 


EcXA070 


573179 


573562 


EcXA071 


2744454 


2745815 


EcXA072 


3034393 


3037763 


EcXA073 


152829 


156201 


EcXA074 


1342781 


1344766 


EcXA075 


558197 


563068 


EcXA076 


1124785 


1126952 


EcXA077 


2230898 


2231859 


EcXA078 


1234932 


1236464 


EcXA079 


2034816 


2036893 


EcXA08I 


2183937 


2185318 


EcXA082 


3137731 


3143155 


EcXA083 


162060 


164534 


EcXA084 


1461563 


1463085 


EcXA085 


3276555 


3280771 


EcXA086 


3437253 


3440108 


EcXA087 


2213765 


2217501 


EcXA089 


177662 


179153 


EcXA090 


202560 


208608 


EcXA091 


320832 


323677 


EcXA092 


1577657 


1580581 


EcXA093 


2183937 


2185318 


EcXA094 


2771339 


2773042 


EcXA095 


2879074 


2880165 


EcXA096 


2890237 


2892794 
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Molecule 
Number 


Start 


StOD 


PrYAflQ7 


1177820 


1180907 


FrYAflOR 

CCAAU/O 


1417251 


1418666 


FrXAlOO 


3881965 


1882481 


PrYAlOl 


4055987 

HUJJ70 # 


4057762 




4114746 


4118102 


CCAAIUJ 


4501 566 


4501071 

HJUJ7 / J 




4*177 Al ft 


4joUO 1 o 


P/»Y A 1 05 


14 1 K1SQ 


1471££B 


CCAnlUO 


JJJOlU 


51X11 1 


c-YA im 
uCAA IU / 


71^1 S0 1 

L \ J 1 07 I 


7 1 £ftQft 1 


p-ya inn 




4nnA4^7 

*fUUO*>OZ 


PrYA 1O0 


7041 1 7 


70A815 
/7O0J3 


PrYA 1 in 


14117ft0 


1414017 


P/» Y A 1 1 1 


70*I7ft»7 


70A7 1 00 


EcXAl 12 


3524107 


3528290 


EcXA114 


2465875 


2469125 


EcXAl 15 


2988576 


2989022 


EcXAl 16 


289873 


290570 


EcXAl 17 


3759810 


3765549 



EXAMPLES 

Identification of Individual Genes within an Operon Required for Proliferation 
5 The following example illustrates a method for determining which gene in an operon is 

required for proliferation. The clone insert corresponding to Molecule No. EcXA066 possesses 
nucleic acid sequence homology to the E. coii genes rplT and rpml. These genes are located in an 
operon containing two additional genes, infC and thrS. To determine which gene or genes in this 
operon are required for proliferation, each gene is selectively inactivated using homologous 
1 0 recombination. Gene rplT is the first gene to be inactivated. 

Deletion inactivation of a chromosomal copy of a gene in E, coli can be accomplished by 
integrative gene replacement The principle of this method (Hamilton, C. M., et al 1989. J, 
BacterioL 171: 4617-4622) is to construct a mutant allele of the targeted gene, introduce that allele 
into the chromosome using a conditional suicide vector, and then force the removal of the native 
15 wild type allele and vector sequences. This will replace the native gene with a desired mutation(s) 
but leave promoters, operators, etc. intact. Essentiality of a gene is determined either by deduction 
from genetic analysis or by conditional expression of a wild type copy of the targeted gene (trans 
complementation). 
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The first step is to generate a mutant rplT allele using PCR amplification. Two sets of 
PCR primers are chosen to produce a copy of rplT with a large central deletion to inactivate the 
gene. In order to eliminate polar effects, it is desirable to construct a mutant allele comprising an 
in-frame deletion of most or all of the coding region of the rplT gene. Each set of PCR primers is 
5 chosen such that a region flanking the gene to be amplified is sufficiently long to allow 
recombination (typically at least 500 nucleotides on each side of the deletion). The targeted 
deletion or mutation will be contained within this fragment. To facilitate cloning of the PCR 
product, the PCR primers may also contain restriction endonuclease sites found in the cloning 
region of a conditional knockout vector such as pK03 (Link, et al 1997 7. Bacteriol 179 (20): 

10 6228-6237). Suitable sites include NotI, Sail, BamHI and Smal. The rplT gene fragments are 
produced using standard PCR conditions including, but not limited to, those outlined in the 
manufacturers directions for the Hot Start Taq PCR kit (Qiagen, Inc., Valencia, CA). The PCR 
reactions will produce two fragments that can be fiised together. Alternatively, crossover PCR can 
be used to generate a desired deletion in one step (Ho, S. R, et al 1989. Gene 11: 5 1-59, Horton, R. 

15 M., et al 1989. Gene 11: 61-68). The mutant allele thus produced is called a "null" allele because it 
cannot produce a functional gene product. 

The mutant allele obtained from PCR amplification is cloned into the multiple cloning site 
of pK03. Directional cloning of the rplT null allele is not necessary. The pK03 vector has a 
temperature-sensitive origin of replication derived from pSClOl. Therefore, clones are propagated 

20 at the permissive temperature of 30°C. The vector also contains two selectable marker genes: one 
that confers resistance to chloramphenicol and another, the Bacillus subtilis sacB gene, that allows 
for counter-selection on sucrose containing growth medium. Clones that contain vector DNA with 
the null allele inserted are confirmed by restriction endonuclease analysis and DNA sequence 
analysis of isolated plasmid DNA. The plasmid containing the rplT null allele insert is known as a 

25 knockout plasmid. 

Once the knockout plasmid has been constructed and its sequence verified, it is transformed 
into a Rec* E. coli host cell. Transformation can be by any standard method such as 
electroporation. In some fraction of the transformed cells, plasmids will integrate into the £. coli 
chromosome by homologous recombination between the rplT null allele in the plasmid and the rplT 

30 gene in the chromosome. Transformant colonies in which such an event has occurred are readily 
selected by growth at the non-permissive temperature of 43°C and in the presence of 
chloramphenicol. At this temperature, the plasmid will not replicate as an episome and will be lost 
from cells as they grow and divide. These cells are no longer resistant to chloramphenicol and will 
not grow when it is present. However, cells in which the knockout plasmid has integrated into the 

35 E. coli chromosome remain resistant to chloramphenicol and propagate. 

Cells containing integrated knock-out plasmids are usually the result of a single crossover 
event that creates a tandem repeat of the mutant and native wild type alleles of rplT separated by the 
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vector sequences. A consequence of this is that rplT will still be expressed in these cells. In order 
to determine if the gene is essential for growth, the wild type copy must be removed This is 
accomplished by selecting for plasmid excision, a process in which homologous recombination 
between the two alleles results in looping out of the plasmid sequences. Cells that have undergone 
5 such an excision event and have lost plasmid sequences including sacB gene are selected for by 
addition of sucrose to the medium. The sacB gene product converts sucrose to a toxic molecule. 
Thus counter selection with sucrose ensures that plasmid sequences are no longer present in the 
cell. Loss of plasmid sequences is further confirmed by testing for sensitivity to chloramphenicol 
(loss of the chloramphenicol resistance gene). The latter test is important because occasionally a 

10 mutation in the sacB gene can occur resulting in a loss of sacB function with no effect on plasmid 
replication (Link, et al., 1997 J. Bacteriol 179 (20): 6228-6237). These artifact clones retain 
plasmid sequences and are therefore still resistant to chloramphenicol. 

In the process of plasmid excision, one of the two rplT alleles is lost from the chromosome 
along with the plasmid DNA. In general, it is equally likely that the null allele or the wild type 

15 allele will be lost. Therefore, if the rplT gene is not essential, half of the clones obtained in this 
experiment will have the wild type allele on the chromosome and half will have the null allele. 
However, if the /p/Tgene is essential, cells containing the null allele will not be obtained as a single 
copy of the null allele would be lethal. 

To determine the essentiality of rplT, a statistically significant number of the resulting 

20 clones, at least 20, are analyzed by PCR amplification of the rplT gene. Since the null allele is 
missing a significant portion of the rplT gene, its PCR product is significantly shorter than that of 
the wild type gene and the two are readily distinguished by gel electrophoretic analysis. The PCR 
products may also be subjected to sequence determination for further confirmation by methods well 
known to those in the art. 

25 The above experiment is generally adequate for determining the essentiality of a gene such 

as rplT. However, it may be necessary or desirable to more directly confirm the essentiality of the 
gene. There are several methods by which this can be accomplished. In general, these involve 
three steps: 1) construction of an episome containing a wild type allele, 2) isolation of clones 
containing a single chromosomal copy of the mutant null allele as described above but in the 

30 presence of the episomal wild type allele, and then 3) determining if the cells survive when the 
expression of the episomal allele is shut off. In this case, the trans copy of wild type rplT is made 
by PCR cloning of the entire coding region of rplT and inserting it in the sense orientation 
downstream of an inducible promoter such as the £. coll lac promoter. Transcription of this allele 
of rplT will be induced in the presence of IPTG which inactivates the lac repressor. Under IPTG 

35 induction rplT protein will be expressed as long as the recombinant gene also possesses a ribosomal 
binding site, also known as a "Shine-Dalgarno Sequence". The trans copy of rplT is cloned on a 
plasmid that is compatible with pSClOl. Compatible vectors include pl5A, pBR322, and the pUC 
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plasmids, among others. Replication of the compatible plasmid will not be temperature-sensitive. 
The entire process of integrating the null allele of rplT and subsequent plasmid excision is carried 
out in the presence of IPTG to ensure the expression of functional rplT protein is maintained 
throughout. After the null rplT allele is confirmed as integrated on the chromosome in place of the 
5 wild type rplT allele, then IPTG is withdrawn and expression of functional rplT protein shut off. If 
the rplT gene is essential, cells will cease to proliferate under these conditions. However, if the 
rplT gene is not essential, cells will continue to proliferate under these conditions. In this 
experiment, essentiality is determined by conditional expression of a wild type copy of the gene 
rather than inability to obtain the intended chromosomal disruption. 

10 An advantage of this method over some other gene disruption techniques is that the targeted 

gene can be deleted or mutated without the introduction of large segments of foreign DNA. 
Therefore, polar effects on downstream genes are eliminated or minimized. There are methods 
described to introduce inducible promoters upstream of potential essential bacterial genes. 
However in such cases, polarity from multiple transcription start points can be a problem. One way 

15 of preventing this is to insert a gene disruption cassette that contains strong transcriptional 
terminators upstream of the integrated inducible promoter (Zhang, Y, and Cronan, J. E. 1996 J. 
Bacteriol 178 (12): 3614-3620). The described techniques will all be familiar to one of ordinary 
skill in the art. 

Following the analysis of the rplT gene, the other genes of the operon are investigated to 
20 determine if they are required for proliferation. 

EXAMPLE 6 

Expression of the Proteins Encoded bv Genes Identified as Required for E. coli Proliferation 
The following is provided as one exemplary method to express the proliferation-required 
proteins encoded by the identified sequences described above. First, the initiation and termination 

25 codons for the gene are identified. If desired, methods for improving translation or expression of the 
protein are well known in the art. For example, if the nucleic acid encoding the polypeptide to be 
expressed lacks a methionine codon to serve as the initiation site, a strong Shine-Delgamo sequence, or 
a stop codon, these sequences can be added. Similarly, if the identified nucleic acid sequence lacks a 
transcription termination signal, this sequence can be added to the construct by, for example, splicing 

30 out such a sequence from an appropriate donor sequence. In addition, the coding sequence may be 
operably linked to a strong promoter or an inducible promoter if desired. The identified nucleic acid 
sequence or portion thereof encoding the polypeptide to be expressed is obtained by PCR from the 
bacterial expression vector or genome using oligonucleotide primers complementary to the identified 
nucleic acid sequence or portion thereof and containing restriction endonuclease sequences for Nco\ 

35 incorporated into the 5' prima* and BglU at the 5' end of the corresponding 3 '-primer, taking care to 
ensure that the identified nucleic acid sequence is positioned in frame with the termination signal. The 
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purified fragment obtained from the resulting PCR reaction is digested with Ncol and BglU, purified 
and ligated to an expression vector. 

The ligated product is transformed into DH5ot or some other E. coli strain suitable for the over 
expression of potential proteins. Transformation protocols are well known in the art. For example, 
5 transformation protocols are described in: Current Protocols in Molecular Biology, Vol. 1, Unit 1 .8, 
(Ausubel, et al., Eds.) John Wiley & Sons, Inc. (1997). Positive transform ants are selected after 
growing the transformed cells on plates containing 50-100 ug/ml Ampicillin (Sigma, St Louis, 
Missouri). In one embodiment, the expressed protein is held in die cytoplasm of the host organism. In 
an alternate embodiment, the expressed protein is released into the culture medium. In still another 

10 alternative, the expressed protein can be sequestered in the periplasmic space and liberated therefrom 
using any one of a number of cell lysis techniques known in the art. For example, the osmotic shock 
cell lysis method described in Chapter 16 of Current Protocols in Molecular Biology, Vol. 2, 
(Ausubel, et al., Eds.) John Wiley & Sons, Inc. (1997). Each of these procedures can be used to 
express a proliferation-required protein. 

15 Expressed proteins, whether in the culture medium or liberated from the periplasmic space or 

the cytoplasm, are then purified or enriched from the supernatant using conventional techniques such as 
ammonium sulfate precipitation, standard chromatography, immunoprecipitation, 
immunochromatography, size exclusion chromatography, ion exchange chromatography, and HPLC. 
Alternatively, the secreted protein can be in a sufficiently enriched or pure state in the supernatant or 

20 growth media of the host to permit it to be used for its intended purpose without further enrichment. 
The purity of the protein product obtained can be assessed using techniques such as Coomassie or 
silver staining or using antibodies against the control protein. Coomassie and silver staining techniques 
are familiar to those skilled in the art 

Antibodies capable of specifically recognizing the protein of interest can be generated using 

25 synthetic peptides using methods well known in the art. See, Antibodies: A Laboratory Manual, 
(Harlow and Lane, Eds.) Cold Spring Harbor Laboratory (1 988). For example, 1 5-mer peptides having 
a sequence encoded by the appropriate identified gene sequence of interest or portion thereof can be 
chemically synthesized. The synthetic peptides are injected into mice to generate antibodies to the 
polypeptide encoded by the identified nucleic acid sequence of interest or portion thereof. 

30 Alternatively, samples of the protein expressed from the expression vectors discussed above can be 
purified and subjected to amino acid sequencing analysis to confirm the identity of the recombinantly 
expressed protein and subsequently used to raise antibodies. An Example describing in detail the 
generation of monoclonal and polyclonal antibodies appears in Example 7. 

The protein encoded by the identified nucleic acid sequence of interest or portion thereof can 

35 be purified using standard immunochromatography techniques. In such procedures, a solution 
containing the secreted protein, such as the culture medium or a cell extract, is applied to a column 
having antibodies against the secreted protein attached to the chromatography matrix. The secreted 
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protein is allowed to bind the immunochromatography column. Thereafter, the column is washed to 
remove non-specifically bound proteins. The specifically bound secreted protein is then released from 
the column and recovered using standard techniques. These procedures are well known in the art. 

In an alternative protein purification scheme, the identified nucleic acid sequence of interest or 
5 portion thereof can be incorporated into expression vectors designed for use in purification schemes 
employing chimeric polypeptides. In such strategies the coding sequence of the identified nucleic acid 
sequence of interest or portion thereof is inserted in-frame with the gene encoding the other half of the 
chimera. The other half of the chimera can be maltose binding protein (MBP) or a nickel binding 
polypeptide encoding sequence. A chromatography matrix having antibody to MBP or nickel attached 
10 thereto is then used to purify the chimeric protein. Protease cleavage sites can be engineered between 
the MBP gene or the nickel binding polypeptide and the identified expected gene of interest, or portion 
thereof. Thus, the two polypeptides of the chimera can be separated from one another by protease 
digestion. 

One useful expression vector for generating maltose binding protein fusion proteins is pMAL 
15 (New England Biolabs), which encodes the malE gene. In the pMal protein fusion system, the cloned 
gene is inserted into a pMal vector downstream from the malE gene. This results in the expression of 
an MBP-fusion protein. The fusion protein is purified by affinity chromatography. These techniques 
as described are well known to those skilled in the art of molecular biology. 

EXAMPLE 7 

20 Production of an Antibody to an isolated E. coli Protein 

Substantially pure protein or polypeptide is isolated from the transformed cells as described in 
Example 6. The concentration of protein in the final preparation is adjusted, for example, by 
concentration on a 10,000 molecular weight cut off" AMICON filter device (Millipore, Bedford, MA), 
to the level of a few micrograms/ml. Monoclonal or polyclonal antibody to the protein can then be 
25 prepared as follows: 

Monoclonal Antibody Production by Hybridoma Fusion 

Monoclonal antibody to epitopes of any of the peptides identified and isolated as described can 
be prepared from murine hybridomas according to the classical method of Kohler, G. and Milstein, C, 
Nature 256:495 (1975) or any of the well-known derivative methods thereof. Briefly, a mouse is 

30 repetitively inoculated with a few micrograms of the selected protein or peptides derived therefrom 
over a period of a few weeks. The mouse is then sacrificed, and the antibody producing cells of the 
spleen isolated. The spleen cells are fused by means of polyethylene glycol with mouse myeloma cells, 
and the excess unfused cells destroyed by growth of the system on selective media comprising 
aminopterin (HAT media). The successfully fused cells are diluted and aliquots of the dilution placed 

35 in wells of a microtiter plate where growth of the culture is continued. Antibody-producing clones are 
identified by detection of antibody in the supernatant fluid of the wells by immunoassay procedures, 
such as EUSA, as described by Engvall, E., "Enzyme immunoassay ELISA and EMIT/' Meth. 
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EnzymoL 70:419 (1980), and derivative methods thereof. Selected positive clones can be expanded 
and their monoclonal antibody product harvested for use. Detailed procedures for monoclonal antibody 
production are described in Davis, L. et al. Basic Methods in Molecular Biology Elsevier, New York. 
Section 21-2. 

5 Polyclonal Antibody Production by Immunization 

Polyclonal antiserum containing antibodies to heterogeneous epitopes of a single protein or a 
peptide can be prepared by immunizing suitable animals with the expressed protein or peptides derived 
therefrom described above, which can be unmodified or modified to enhance immunogenicity. 
Effective polyclonal antibody production is affected by many factors related both to the antigen and the 

10 host species. For example, small molecules tend to be less immunogenic than larger molecules and can 
require the use of carriers and adjuvant Also, host animals vary in response to site of inoculations and 
dose, with both inadequate or excessive doses of antigen resulting in low titer antisera. Small doses (ng 
level) of antigen administered at multiple intradermal sites appears to be most reliable. An effective 
immunization protocol for rabbits can be found in Vaitukaitis, J. et al. J. Clin. Endocrinol Metab. 

15 33:988-991 (1971). 

Booster injections can be given at regular intervals, and antiserum harvested when antibody 
titer thereof, as determined semi-quantitatively, for example, by double immunodiffiision in agar 
against known concentrations of the antigen, begins to fall. See, for example, Ouchterlony, O. et al., 
Chap. 19 in: Handbook of Experimental Immunology D. Wier (ed) Blackwell (1973). Plateau 

20 concentration of antibody is usually in the range of 0. 1 to 0.2 mg/ml of serum (about 1 2 jiM). Affinity 
of the antisera for the antigen is determined by preparing competitive binding curves, as described, for 
example, by Fisher, D., Chap. 42 in: Manual of Clinical Immunology, 2d Ed. (Rose and Friedman, 
Eds.) Amer. Soc. For Microbiol., Washington, D.C. (1980). 

Antibody preparations prepared according to either protocol are useful in quantitative 

25 immunoassays which determine concentrations of antigen-bearing substances in biological samples; 
they are also used semi-quantitatively or qualitatively to identify the presence of antigen in a biological 
sample. The antibodies can also be used in therapeutic compositions for killing bacterial cells 
expressing the protein. 

EXAMPLE 8 

30 Screening Chemical Libraries 

A. Protein-Based Assays 

Having isolated and expressed bacterial proteins shown to be required for bacterial 
proliferation, the present invention further contemplates the use of these expressed proteins in assays to 
screen libraries of compounds for potential drug candidates. The generation of chemical libraries is 

35 well known in the art For example combinatorial chemistry can be used to generate a library of 
compounds to be screened in the assays described herein. A combinatorial chemical library is a 
collection of diverse chemical compounds generated by either chemical synthesis or biological 

-48- 



WO 01/34810 



PC17US00/30950 



synthesis by combining a number of chemical "building blocks" reagents. For example, a linear 
combinatorial chemical library such as a polypeptide library is formed by combining amino acids in 
every possible combination to yield peptides of a given length. Millions of chemical compounds 
theoretically can be synthesized through such combinatorial mixings of chemical building blocks. For 
5 example, one commentator observed that the systematic, combinatorial mixing of 100 interchangeable 
chemical building blocks results in the theoretical synthesis of 100 million tetrameric compounds or 10 
billion pentameric compounds. (Gallop et ah, "Applications of Combinatorial Technologies to Drug 
Discovery, Background and Peptide Combinatorial Libraries," Journal of Medicinal Chemistry, Vol. 
37, No. 9, 1233-1250 (1994). Other chemical libraries known to those in the art may also be used, 

1 0 including natural product libraries. 

Once generated, combinatorial libraries can be screened for compounds that possess desirable 
biological properties. For example, compounds which may be useful as drugs or to develop drugs 
would likely have the ability to bind to the target protein identified, expressed and purified as discussed 
above. Further, if the identified target protein is an enzyme, candidate compounds would likely 

1 5 interfere with the enzymatic properties of the target protein. Any enzyme can be a target protein. For 
example, the enzymatic function of a target protein can be to serve as a protease, nuclease, 
phosphatase, dehydrogenase, transporter protein, transcriptional enzyme, and any other type of 
enzyme known or unknown. Thus, the present invention contemplates using the protein products 
described above to screen combinatorial and other chemical libraries. 

20 Those in the art will appreciate that a number of techniques exist for characterizing target 

proteins in order to identify molecules useful for the discovery and development of therapeutics. For 
example, some techniques involve the generation and use of small peptides to probe and analyze target 
proteins both biochemically and genetically in order to identify and develop drug leads. Such 
techniques include the methods described in PCT publications No. W09935494, W09819162, 

25 W09954728. 

In another example, the target protein is a serine protease and the substrate of the enzyme is 
known. The present example is directed towards the analysis of libraries of compounds to identify 
compounds that function as inhibitors of the target enzyme. First, a library of small molecules is 
generated using methods of combinatorial library formation well known in the art U.S. Patent NOs. 

30 5,463,564 and 5,574, 656, to Agrafiotis, et aL, entitled "System and Method of Automatically 
Generating Chemical Compound with Desired Properties," are two such teachings. Then the library 
compounds are screened to identify library compounds that possess desired structural and functional 
properties. U.S. Patent No. 5,684,71 1 also discusses a method for screening libraries. 

To illustrate the screening process, the combined target and chemical compounds of the library 

35 are exposed to and permitted to interact with the purified enzyme. A labeled substrate is added to the 
incubation. The label on the substrate is such that a detectable signal is emitted from metabolized 
substrate molecules. The emission of this signal permits one to measure the effect of the combinatorial 
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library compounds on the enzymatic activity of target enzymes. The characteristics of each library 
compound is encoded so that compounds demonstrating activity against the enzyme can be analyzed 
and features common to the various compounds identified can be isolated and combined into future 
iterations of libraries. 

S Once a library of compounds is screened, subsequent libraries are generated using those 

chemical building blocks that possess the features shown in the first round of screen to have activity 
against the target enzyme. Using this method, subsequent iterations of candidate compounds will 
possess more and more of those structural and functional features required to inhibit the function of the 
target enzyme, until a group of enzyme inhibitors with high specificity for the enzyme can be found. 
10 These compounds can then be further tested for their safety and efficacy as antibiotics for use in 
mammals. 

It will be readily appreciated that this particular screening methodology is exemplary only. 
Other methods are well known to those skilled in the art For example, a wide variety of screening 
techniques are known for a large number of naturally-occurring targets when the biochemical 
15 function of the target protein is known. 
B. Cell Based Assays 

Current cell-based assays used to identify or to characterize compounds for drug discovery 
and development frequently depend on detecting the ability of a test compound to inhibit the 
activity of a target molecule located within a cell or located on the surface of a cell. Most often 

20 such target molecules are proteins such as enzymes, receptors and the like. However, target 
molecules may also include other molecules such as DNAs, lipids, carbohydrates and RNAs 
including messenger RNAs, ribosomal RNAs, tRNAs and the like. A number of highly sensitive 
cell-based assay methods are available to those of skill in the art to detect binding and interaction of 
test compounds with specific target molecules. However, these methods are generally not highly 

25 effective when the test compound binds to or otherwise interacts with its target molecule with 
moderate or low affinity. In addition, the target molecule may not be readily accessible to a test 
compound in solution, such as when the target molecule is located inside the cell or within a 
cellular compartment such as the periplasm of a bacterial cell. Thus, current cell-based assay 
methods are limited in that they are not effective in identifying or characterizing compounds that 

30 interact with their targets with moderate to low affinity or compounds that interact with targets that 
are not readily accessible. 

Cell-based assay methods of the present invention have substantia] advantages over current 
cell-based assays practiced in the art. These advantages derive from the use of sensitized cells in 
which the level or activity of a proliferation-required gene product (the target molecule) has been 

35 specifically reduced to the point where the presence or absence of its function becomes a rate- 
determining step for cellular proliferation. Bacterial, fungal, plant, or animal cells can all be used 
with the present method. Such sensitized cells become much more sensitive to compounds that are 
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active against the affected target molecule. Thus, cell-based assays of the present invention are 
capable of detecting compounds exhibiting low or moderate potency against the target molecule of 
interest because such compounds are substantially more potent on sensitized cells than on non- 
sensitized cells. The affect may be such that a test compound may be two to several times more 

5 potent, at least 10 times more potent or even at least 100 times more potent when tested on the 
sensitized cells as compared to the non-sensitized cells. 

Due in part to the increased appearance of antibiotic resistance in pathogenic 
microorganisms and to the significant side-effects associated with some currently used antibiotics, 
novel antibiotics acting at new targets are highly sought after in the art. Yet, another limitation in 

10 the current art related to cell-based assays is the problem of identifying hits against the same kinds 
of target molecules in the same limited set of biological pathways over and over again. This may 
occur when compounds acting at such new targets are discarded, ignored or fail to be detected 
because compounds acting at the "old" targets are encountered more frequently and are more potent 
than compounds acting at the new targets. As a result, the majority of antibiotics in use currently 

15 interact with a relatively small number of target molecules within an even more limited set of 
biological pathways. 

The use of sensitized cells of the current invention provides a solution to the above problem 
in two ways. First, desired compounds acting at a target of interest, whether a new target or a 
previously known but poorly exploited target, can now be detected above the "noise" of compounds 

20 acting at the "old" targets due to the specific and substantial increase in potency of such desired 
compounds when tested on the sensitized cells of the current invention. Second, the methods used 
to sensitize cells to compounds acting at a target of interest may also sensitize these cells to 
compounds acting at other target molecules within the same biological pathway. For example, 
expression of an antisense molecule to a gene encoding a ribosomal protein is expected to sensitize 

25 the cell to compounds acting at that ribosomal protein and may also sensitize the cells to 
compounds acting at any of the ribosomal components (proteins or rRNA) or even to compounds 
acting at any target which is part of the protein synthesis pathway. Thus an important advantage of 
the present invention is the ability to reveal new targets and pathways that were previously not 
readily accessible to drug discovery methods. 

30 Sensitized cells of the present invention are prepared by reducing the activity or level of a 

target molecule. The target molecule may be a gene product, such as an RNA or polypeptide 
produced from the proliferation-required nucleic acids described herein. Alternatively, the target 
may be a gene product such as an RNA or polypeptide which is produced form a sequence within 
the same operon as the proliferation-required nucleic acids described herein. In addition, the target 

35 may be an RNA or polypeptide in the same biological pathway as the proliferation-required nucleic 
acids described herein. Such biological pathways include, but are not limited to, enzymatic, 
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biochemical and metabolic pathways as well as pathways involved in the production of cellular 
structures such the cell wall. 

Current methods employed in the arts of medicinal and combinatorial chemistries are able 
to make use of structure-activity relationship information derived from testing compounds in 
5 various biological assays including direct binding assays and cell-based assays. Occasionally 
compounds are directly identified in such assays that are sufficiently potent to be developed as 
drugs. More often, initial hit compounds exhibit moderate or low potency. Once a hit compound is 
identified with low or moderate potency, directed libraries of compounds are synthesized and tested 
in order to identify more potent leads. Generally these directed libraries are combinatorial chemical 

10 libraries consisting of compounds with structures related to the hit compound but containing 
systematic variations including additions, subtractions and substitutions of various structural 
features. When tested for activity against the target molecule, structural features are identified that 
either alone or in combination with other features enhance or reduce activity. This information is 
used to design subsequent directed libraries containing compounds with enhanced activity against 

15 the target molecule. After one or several iterations of this process, compounds with substantially 
increased activity against the target molecule are identified and may be further developed as drugs. 
This process is facilitated by use of the sensitized cells of the present invention since compounds 
acting at the selected targets exhibit increased potency in such cell-based assays, thus; more 
compounds can now be characterized providing more useful information than would be obtained 

20 otherwise. 

Thus, it is now possible using cell-based assays of the present invention to identify or 
characterize compounds that previously would not have been readily identified or characterized 
including compounds that act at targets that previously were not readily exploited using cell-based 
assays. The process of evolving potent drug leads from initial hit compounds is also substantially 

25 improved by the cell-based assays of the present invention because, for the same number of test 
compounds, more structure-function relationship information is likely to be revealed. 

The method of sensitizing a cell entails selecting a suitable gene or operon. A suitable gene 
or operon is one whose expression is required for the proliferation of the cell to be sensitized. The 
next step is to introduce into the cells to be sensitized, an antisense RNA capable of hybridizing to 

30 the suitable gene or operon or to the RNA encoded by the suitable gene or operon. Introduction of 
the antisense RNA can be in the form of an expression vector in which antisense RNA is produced 
under the control of an inducible promoter. The amount of antisense RNA produced is limited by 
varying the inducer concentration to which the cell is exposed and thereby varying the activity of 
the promoter driving transcription of the antisense RNA. Thus, cells are sensitized by exposing 

35 them to an inducer concentration that results in a sub-lethal level of antisense RNA expression. 

In one embodiment of the cell-based assays, the identified exogenous E. coli nucleotide 
sequences of the present invention are used to inhibit the production of a proliferation-required 
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protein. Expression vectors producing antisense RNA against identified genes required for 
proliferation are used to limit the concentration of a proliferation-required protein without severly 
inhibiting growth. To achieve that goal, a growth inhibition dose curve of inducer is calculated by 
plotting various doses of inducer against the corresponding growth inhibition caused by the 
5 antisense expression. From this curve, various percentages of antisense induced growth inhibition, 
from 1 to 100% can be determined. If the promoter contained in the expression vector contains a 
lac operator the transcription is regulated by lac repressor and expression from the promoer is 
inducible with IPTG. For example, the highest concentration of the inducer IPTG that does not 
reduce the growth rate (0% growth inhibition) can be predicted from the curve. Cellular 

10 proliferation can be monitored by growth medium turbidity via OD measurements. In another 
example, the concentration of inducer that reduces growth by 25% can be predicted from the curve. 
In still another example, a concentration of inducer that reduces growth by 50% can be calculated. 
Additional parameters such as colony forming units (cfu) can be used to measure cellular viability. 
Cells to be assayed are exposed to the above-determined concentrations of inducer. The 

15 presence of the inducer at this sub-lethal concentration reduces the amount of the proliferation 
required gene product to a low amount in the cell that will limit but not prevent growth. Cells 
grown in the presence of this concentration of inducer are therefore specifically more sensitive to 
inhibitors of the proliferation-required protein or RNA of interest or to inhibitors of proteins or 
RNAs in the same biological pathway as the proliferation-required protein or RNA of interest but 

20 not to inhibitors of unrelated proteins or RNAs. 

Cells pretreated with sub-inhibitory concentrations of inducer and thus containing a reduced 
amount of proliferation-required target gene product are then used to screen for compounds that 
reduce cell growth. The sub-lethal concentration of inducer may be any concentration consistent 
with the intended use of the assay to identify candidate compounds to which the cells are more 

25 sensitive. For example, the sub- lethal concentration of the inducer may be such that growth 
inhibition is at least about 5%, at least about 8%, at least about 10%, at least about 20%, at least 
about 30%, at least about 40%, at least about 50%, at least about 60% at least about 75%, 90%, 
95% or more. Cells which are pre-sensitized using the preceding method are more sensitive to 
inhibitors of the target protein because these cells contain less target protein to be inhibited than do 

30 wild-type cells. 

In another embodiment of the cell based assays of the present invention, the level or 
activity of a proliferation required gene product is reduced using a mutation, such as a temperature 
sensitive mutation, in the proliferation-required sequence and an antisense nucleic acid against the 
proliferation-required sequence. Growing the cells at an intermediate temperature between the 
35 permissive and restrictive temperatures of the temperature sensitive mutant where the mutation is in 
a proliferation-required gene produces cells with reduced activity of the proliferation-required gene 
product. The antisense RNA directed against the proliferation-required sequence further reduces 
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the activity of the proliferation required gene product Drugs that may not have been found using 
either the temperature sensitive mutation or the antisense nucleic acid alone may be identified by 
determining whether cells in which expression of the antisense nucleic acid has been induced and 
which are grown at a temperature between the permissive temperature and the restrictive 
5 temperature are substantially more sensitive to a test compound than cells in which expression of 
the antisense nucleic acid has not been induced and which are grown at a permissive temperature. 
Also drugs found previously from either the antisense nucleic acid alone or the temperature 
sensitive mutation alone may have a different sensitivity profile when used in cells combining the 
two approaches, and that sensitivity profile may indicate a more specific action of the drug in 

1 0 inhibiting one or more activities of the gene product. 

Temperature sensitive mutations may be located at different sites within the gene and 
correspond to different domains of the protein. For example, the dnaB gene of Escherichia coli 
encodes the replication fork DNA helicase. DnaB has several domains, including domains for 
oligomerization, ATP hydrolysis, DNA binding, interaction with primase, interaction with DnaC, 

IS and interaction with DnaA [(Biswas, E.E. and Biswas, S.B. 1999. Mechanism and DnaB helicase of 
Escherichia coli: structural domains involved in ATP hydrolysis, DNA binding, and 
oligomerization. Biochem. 38:10919-10928; Hiasa, H. and Marians, K.J. 1999. Initiation of 
bidirectional replication at the chromosomal origin is directed by the interaction between helicase 
and primase. J. Biol. Chem. 274:27244-27248; San Martin, C, Radermacher, M., Wolpensinger, 

20 B., Engel, A., Miles, C.S., Dixon, N.E., and Carazo, J.M. 1998. Three-dimensional reconstructions 
from cryoelectron microscopy images reveal an intimate complex between helicase DnaB and its 
loading partner DnaC. Structure 6:501-9; Sutton, M.D., Carr, K.M., Vicente, M., and Kaguni, J.M. 
1998. Escherichia coli DnaA protein. The N-terminal domain and loading of DnaB helicase at the 
E. coli chromosomal. J. Biol. Chem. 273:34255-62.)]. Temperature sensitive mutations in different 

25 domains of DnaB confer different phenotypes at the restrictive temperature, which include either an 
abrupt stop or slow stop in DNA replication with or without DNA breakdown (Wechsler, JA. and 
Gross, J.D. 1971. Escherichia coli mutants temperature-sensitive for DNA synthesis. Mol. Gen. 
Genetics 113:273-284) and termination of growth or cell death. Combining the use of temperature 
sensitive mutations in the dnaB gene that cause cell death at the restrictive temperature with an 

30 antisense to the dnaB gene could lead to the discovery of very specific and effective inhibitors of 
one or a subset of activities exhibited by DnaB. 

When screening for antimicrobial agents against a gene product required for proliferation, 
growth inhibition of cells containing a limiting amount of that proliferation-required gene product 
can be assayed. Growth inhibition can be measured by directly comparing the amount of growth, 

35 measured by the optical density of the growth medium, between an experimental sample and a 
control sample. Alternative methods for assaying cell proliferation include measuring green 



.-54- 



WO 01/34810 



PCT/US00/30950 



fluorescent protein (GFP) reporter construct emissions, various enzymatic activity assays, and other 
methods well known in the art 

It will be appreciated that the above method may be performed in solid phase, liquid phase 
or a combination of the two. For example, cells grown on nutrient agar containing the inducer of the 
5 antisense construct may be exposed to compounds spotted onto the agar surface. A compound's 
effect may be judged from the diameter of the resulting killing zone, the area around the compound 
application point in which cells do not grow. Multiple compounds may be transferred to agar plates 
and simultaneously tested using automated and semi-automated equipment including but not 
restricted to multi-channel pipettes (for example the Beckman Multimek) and multi-channel 
10 spotters (for example the Genomic Solutions Flexys). In this way multiple plates and thousands to 
millions of compounds may be tested per day. 

The compounds may also be tested entirely in liquid phase using microtiter plates as 
described below. Liquid phase screening may be performed in microtiter plates containing 96, 384, 
1536 or more wells per microtiter plate to screen multiple plates and thousands to millions of 
15 compounds per day. Automated and semi-automated equipment may be used for addition of 
reagents (for example cells and compounds) and determination of cell density. 

EXAMPLE 9 

Cell Based Assay Using Antisense Complementary to Genes Encoding Ribosomal Proteins 
The effectiveness of the above cell based assay was validated using constructs expressing 
20 antisense RNA to the proliferation required E. coli genes rplL 9 rpU 9 and rplW encoding ribosomal 
proteins L7/L12, L10 and L23 respectively. These proteins are part of the protein synthesis 
apparatus of the cell and as such are required for proliferation. These constructs were used to test 
the effect of antisense expression on cell sensitivity to antibiotics known to bind to the ribosome 
and thereby inhibit protein synthesis. Constructs expressing antisense RNA to several other genes 
25 (elaD 9 visC, yohH, and atpE/B), the products of which are not involved in protein synthesis were 
used for comparison. 

First pLex5BA (Krause et al., J. Mol. Biol. 274: 365 (1997)) expression vectors containing 
antisense constructs to either rplWox to elaD were introduced into separate E. coli cell populations. 
Vector introduction is a technique well known to those of ordinary skill in the art. The expression 

30 vectors of this example contain IPTG inducible promoters that drive the expression of the antisense 
RNA in the presence of the inducer. However, those skilled in the art will appreciate that other 
inducible promoters may also be used. Suitable expression vectors are also well known in the art. 
The £ coli antisense clones encoding ribosomal proteins L7/L12, L10 and L23 were used to test the 
effect of antisense expression on cell sensitivity to the antibiotics known to bind to these proteins. 

35 First, expression vectors containing antisense to either the genes encoding L7/L12 and L10 or L23 
were introduced into separate E. coli cell populations. 
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The cell populations were exposed to a range of IPTG concentrations in liquid medium to 
obtain the growth inhibitory dose curve for each clone (Fig. 1). First, seed cultures were grown to a 
particular turbidity that is measured by the optical density (OD) of the growth solution. The OD of 
the solution is directly related to the number of bacterial cells contained therein. Subsequently, 
5 sixteen 200 ul liquid medium cultures were grown in a 96 well microtiter plate at 37 C with a range 
of IPTG concentrations in duplicate two-fold serial dilutions from 1600 uM to 12.5 uM (final 
concentration). Additionally, control cells were grown in duplicate without IPTG. These cultures 
were started from equal amounts of cells derived from the same initial seed culture of a clone of 
interest. The cells were grown for up to 15 hours and the extent of growth was determined by 

10 measuring the optical density of the cultures at 600 nm. When the control culture reached mid-log 
phase the percent growth of the control for each of the IPTG containing cultures was plotted against 
the log concentrations of IPTG to produce a growth inhibitory dose response curve for the IPTG. 
The concentration of IPTG that inhibits cell growth to 50% (IC 5 o) as compared to the 0 mM IPTG 
control (0% growth inhibition) was then calculated from the curve. Under these conditions, an 

15 amount of antisense RNA was produced that reduced the expression levels of rplW and elaD to a 
degree such that growth was inhibited by 50%. 

Alternative methods of measuring growth are also contemplated. Examples of these 
methods include measurements of proteins, the expression of which is engineered into the cells 
being tested and can readily be measured. Examples of such proteins include green fluorescent 

20 protein (GFP) and various enzymes. 

Cells were pretreated with the selected concentration of IPTG and then used to test the 
sensitivity of cell populations to tetracycline, erythromycin and other protein synthesis inhibitors. 
Figure 2 is an IPTG dose response curve in E. coli transformed with an IPTG-inducible plasmid 
containing either an antisense clone to the £. coli ribosomal protein rp/W (AS-rplW) which is 

25 required for protein synthesis and essential for cell proliferation, or an antisense clone to the elaD 
(AS-elaD) gene which is not known to be involved in protein synthesis and which is also essential 
for proliferation. An example of a tetracycline dose response curve is 

shown in Figures 2A and 2B for the rplW and elaD genes, respectively. Cells were grown to log 
phase and then diluted into media alone or media containing IPTG at concentrations which give 

30 20% and 50% growth inhibition as determined by IPTG dose response curves. After 2.5 hours, the 
cells were diluted to a final OD600 of 0.002 into 96 well plates containing (1) +/- IPTG at the same 
concentrations used for the 2.5 hour preincubation; and (2) serial two-fold dilutions of tetracycline 
such that the final concentrations of tetracycline range from 1 ng/ml to 15.6 ng/ml and 0 ng/ml. 
The 96 well plates were incubated at 37°C and the OD600 was read by a plate reader every 5 

35 minutes for up to 15 hours. For each IPTG concentration and the no IPTG control, tetracycline 
dose response curves were determined when the control (absence of tetracycline) reached 0.1 
OD600. To compare tetracycline sensitivity with and without IPTG, tetracycline IC 5 os were 
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determined from the dose response curves (Figs. 3A-B). Cells with reduced levels of L23 (rplW) 
showed increased sensitivity to tetracycline (Fig. 2A) as compared to cells with reduced levels of 
elaD (Fig. 2B). Figure 3 shows a summary bar chart in which the ratios of tetracycline IC 50s 
determined in the presence of IPTG which gives 50% growth inhibition versus tetracycline IC 5 os 
5 determined without IPTG (fold increase in tetracycline sensitivity) were plotted. Cells with reduced 
levels of either L7/L12 (genes rplL, rpU) or L23 (rplW) showed increased sensitivity to tetracycline 
(Fig. 3). Cells expressing antisense to genes not known to be involved in protein synthesis (atpB/E 9 
raC, elaD, yohH) did not show the same increased sensitivity to tetracycline, validating the 
specificity of this assay (Fig. 3). 

10 In addition to the above, it has been observed in initial experiments that clones expressing 

antisense RNA to genes involved in protein synthesis (including genes encoding ribosomal proteins 
L7/L12 & L10, L7/L12 alone, L22, and LI 8, as well as genes encoding rRNA and Elongation 
Factor G) have increased sensitivity to the macrolide, erythromycin, whereas clones expressing 
antisense to the non-protein synthesis genes elaD, atpB/E and visC do not. Furthermore, the clone 

15 expressing antisense to rplL and rpU does not show increased sensitivity to nalidixic acid and 
ofloxacin, antibiotics which do not inhibit protein synthesis. 

The results with the ribosomal protein genes rplL, rpU 9 and rplW as well as the initial 
results using various other antisense clones and antibiotics show that limiting the concentration of 
an antibiotic target makes cells more sensitive to the antimicrobial agents that specifically interact 

20 with that protein. The results also show that these cells are sensitized to antimicrobial agents that 
inhibit the overall function in which the protein target is involved but are not sensitized to 
antimicrobial agents that inhibit other functions. 

The cell based assay described above may also be used to identify the biological pathway in 
which a proliferation-required nucleic acid or its gene product lies. In such methods, cells 

25 expressing a sub-lethal level of antisense to a target proliferation-required nucleic acid and control 
cells in which expression of the antisense has not been induced are contacted with a panel of 
antibiotics known to act in various pathways. If the antibiotic acts in the pathway in which the 
target proliferation-required nucleic acid or its gene product lies, cells in which expression of the 
antisense has been induced will be more sensitive to the antibiotic than cells in which expression of 

30 the antisense has not been induced. 

As a control, the results of the assay may be confirmed by contacting a panel of cells 
expressing antisense nucleic acids to many different proliferation-required genes including the 
target proliferation-required gene. If the antibiotic is acting specifically, heightened sensitivity to 
the antibiotic will be observed only in the cells expressing antisense to a target proliferation- 

35 required gene (or cells expressing antisense to other proliferation-required genes in the same 
pathway as the target proliferation-required gene) but will not be observed generally in all cells 
expressing antisense to proliferation-required genes. 
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Similarly, the above method may be used to determine the pathway on which a test 
compound, such as a test antibiotic acts. A panel of cells, each of which expresses antisense to a 
proliferation-required nucleic acid in a known pathway, is contacted with a compound for which it 
is desired to determine the pathway on which it acts. The sensitivity of the panel of cells to the test 
5 compound is determined in cells in which expression of the antisense has been induced and in 
control cells in which expression of the antisense has not been induced. If the test compound acts 
on the pathway on which an antisense nucleic acid acts, cells in which expression of the antisense 
has been induced will be more sensitive to the compound than cells in which expression of the 
antisense has not been induced. In addition, control cells in which expression of antisense to 
10 proliferation-required genes in other pathways has been induced will not exhibit heightened 
sensitivity to the compound. In this way, the pathway on which the test compound acts may be 
determined. 

The Example below provides one method for performing such assays. 

EXAMPLE 10 

15 Identification of thePathway in which a Proliferation-Required 

Gene Lies or the Pathway on which an Antibiotic Acts 

A. Preparation of Bacterial Stocks for Assay 

To provide a consistent source of cells to screen, frozen stocks of host bacteria containing 
the desired antisense construct are prepared using standard microbiological techniques. For 

20 example, a single clone of the organism can be isolated by streaking out a sample of the original 
stock onto an agar plate containing nutrients for cell growth and an antibiotic for which the 
antisense construct contains a gene which confers resistance. After overnight growth an isolated 
colony is picked from the plate with a sterile needle and transferred to an appropriate liquid growth 
media containing the antibiotic required for maintenance of the plasmid. The cells are incubated at 

25 30°C to 37°C with vigorous shaking for 4 to 6 hours to yield a culture in exponential growth. Sterile 
glycerol is added to 15% (volume to volume) and 100|aL to 500 jiL aliquots are distributed into 
sterile cryotubes, snap frozen in liquid nitrogen, and stored at-80°C for future assays. 

B. Growth of Bacteria for Use in the Assay 

A day prior to an assay, a stock vial is removed from the freezer, rapidly thawed (37°C 
30 water bath) and a loop of culture is streaked out on an agar plate containing nutrients for cell growth 
and an antibiotic to which the antisense construct confers resistance. After overnight growth at 
37°C, ten randomly chosen, isolated colonies are transferred from the plate (sterile inoculum loop) 
to a sterile tube containing 5 mL of LB medium containing the antibiotic to which the antisense 
vector confers resistance. After vigorous mixing to form a homogeneous cell suspension, the optical 
35 density of the suspension is measured at 600 nm (OD600) and if necessary an aliquot of the 
suspension is diluted into a second tube of 5 mL, sterile, LB medium plus antibiotic to achieve an 
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OD600 £ 0.02 absorbance units. The culture is then incubated at 37° C for 1-2 hrs with shaking 
until the OD600 reaches OD 0.2 - 0.3. At this point the cells are ready to be used in the assay. 

5 C. Selection of Media to be Used in Assay 

Two fold dilution series of the inducer are generated in culture media containing the 
appropriate antibiotic for maintenance of the antisense construct. Several media are tested side by 
side and three to four wells are used to evaluate the effects of the inducer at each concentration in 
each media. For example, M9 minimal media, LB broth, TBD broth and Muller-Hinton media may 

1 0 be tested with the inducer IPTG at the following concentrations, 50 ^M, 100 nM, 200 uM, 400 \\M, 
600 jiM, 800 fiM and 1000 jjM. Equal volumes of test media- inducer and cells are added to the 
wells of a 384 well microtiter plate and mixed. The cells are prepared as described above and 
diluted 1: 100 in the appropriate media containing the test antibiotic immediately prior to addition to 
the microtiter plate wells. For a control, cells are also added to several wells of each media that do 

15 not contain inducer, for example 0 uM IPTG. Cell growth is monitored continuously by incubation 
at 37°C in a microtiter plate reader monitoring the OD600 of the wells over an 1 8-hour period. The 
percent inhibition of growth produced by each concentration of inducer is calculated by comparing 
the rates of logarithmic growth against that exhibited by cells growing in media without inducer. 
The medium yielding greatest sensitivity to inducer is selected for use in the assays described 

20 below. 

D. Measurement of Test Antibiotic Sensitivity in the Absence of Antisense Construct Induction 

Two-fold dilution series of antibiotics of known mechanism of action are generated in the 
culture media selected for further assay development that has been supplemented with the antibiotic 
used to maintain the construct. A panel of test antibiotics known to act on different pathways is 

25 tested side by side with three to four wells being used to evaluate the effect of a test antibiotic on 
cell growth at each concentration. Equal volumes of test antibiotic and cells are added to the wells 
of a 384 well microtiter plate and mixed. Cells are prepared as described above using the media 
selected for assay development supplemented with the antibiotic required to maintain the antisense 
construct and are diluted 1:100 in identical media immediately prior to addition to the microtiter 

30 plate wells. For a control, cells are also added to several wells that contain the solvent used to 
dissolve the antibiotics but no antibiotic. Cell growth is monitored continuously by incubation at 
37°C in a microtiter plate reader monitoring the OD600 of the wells over an 18-hour period. The 
percent inhibition of growth produced by each concentration of antibiotic is calculated by 
comparing the rates of logarithmic growth against that exhibited by cells growing in media without 

35 antibiotic. A plot of percent inhibition against logfantibiotic concentration] allows extrapolation of 
an ICso value for each antibiotic. 
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E. Measurement of Test Antibiotic Sensitivity in the Presence of Antisense Construct Inducer 

The culture media selected for use in the assay is supplemented with inducer at 
concentrations shown to inhibit cell growth by SO and 80% as described above and the antibiotic 
used to maintain the construct Two fold dilution series of the panel of test antibiotics used above 
5 are generated in each of these media. Several antibiotics are tested side by side with three to four 
wells being used to evaluate the effects of an antibiotic on cell growth at each concentration, in each 
media. Equal volumes of test antibiotic and cells are added to the wells of a 384 well microtiter 
plate and mixed. Cells are prepared as described above using the media selected for use in the 
assay supplemented with the antibiotic required to maintain the antisense construct The cells are 

10 diluted 1:100 into two 50 mL aliquots of identical media containing concentrations of inducer that 
have been shown to inhibit cell growth by 50% and 80 % respectively and incubated at 37°C with 
shaking for 2.5 hours. Immediately prior to addition to the microtiter plate wells, the cultures are 
adjusted to an appropriate ODaoo (typically 0.002) by dilution into warm (37°C) sterile media 
supplemented with identical concentrations of the inducer and antibiotic used to maintain the 

15 antisense construct. For a control, cells are also added to several wells that contain solvent used to 
dissolve test antibiotics but which contain no antibiotic. Cell growth is monitored continuously by 
incubation at 37°C in a microtiter plate reader monitoring the OD600 of the wells over an 18-hour 
period. The percent inhibition of growth produced by each concentration of antibiotic is calculated 
by comparing the rates of logarithmic growth against that exhibited by cells growing in media 

20 without antibiotic. A plot of percent inhibition against logfantibiotic concentration] allows 
extrapolation of an IC50 value for each antibiotic. 

F. Determining the Specificity of the Test Antibiotics 

A comparison of the IQoS generated by antibiotics of known mechanism of action under 
antisense induced and non-induced conditions allows the pathway in which a proliferation-required 
25 nucleic acid "lies to be identified. If cells expressing an antisense nucleic acid against a 
proliferation-required gene are selectively sensitive to an antibiotic acting via a particular pathway, 
then the gene against which the antisense acts is involved in the pathway in which the antibiotic 
acts. 

G. Identification of Pathway in which a Test Antibiotic Acts 

30 As discussed above, the cell based assay may also be used to determine the pathway against 

which a test antibiotic acts. In such an analysis, the pathways against which each member of a 
panel of antisense nucleic acids acts are identified as described above. A panel of cells, each 
containing an inducible antisense vector against a gene in a known proliferation-required pathway, 
is contacted with a test antibiotic for which it is desired to determine the pathway on which it acts 

35 under inducing an non-inducing conditions. If heightened sensitivity is observed in induced cells 
expressing antisense against a gene in a particular pathway but not in induced cells expressing 
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antisense against genes in other pathways, then the test antibiotic acts against the pathway for which 
heightened sensitivity was observed. 

One skilled in the ait will appreciate that further optimization of the assay conditions, such 
as the concentration of inducer used to induce antisense expression and/or the growth conditions 
5 used for the assay (for example incubation temperature and media components) may further 
increase the selectivity and/or magnitude of the antibiotic sensitization exhibited. 

The following example confirms the effectiveness of the methods described above. 

EXAMPLE 11 

Identification of the Pathway in which a Proliferation-Required Gene Lies 

10 Antibiotics of various chemical classes and modes of action were purchased from Sigma 

Chemicals (St. Louis, MO). Stock solutions were prepared by dissolving each antibiotic in an 
appropriate aqueous solution based on information provided by the manufacturer. The final 
working solution of each antibiotic contained no more than 0.2% (w/v) of any organic solvent To 
determine their potency against a bacterial strain engineered for expression of an antisense against a 

15 proliferation-required gene encoding 50S ribosomal protein, each antibiotic was serially diluted two 
or three fold in growth medium supplemented with the appropriate antibiotic for maintenance of the 
anti-sense construct At least ten dilutions were prepared for each antibiotic. 25 \iL aliquots of each 
dilution were transferred to discrete wells of a 384-well microplate (the assay plate) using a multi- 
channel pipette. Quadruplicate wells were used for each dilution of an antibiotic under each 

20 treatment condition (plus and minus inducer). Each assay plate contained twenty wells for cell 
growth controls (growth media replacing antibiotic), ten wells for each treatment (plus and minus 
inducer, in this example IPTG). Assay plates were usually divided into the two treatments: half the 
plate containing induced cells and an appropriate concentrations of inducer (in this example IPTG) 
to maintain the state of induction, the other half containing non-induced cells in the absence of 

25 IPTG. 

Cells for the assay were prepared as follows. Bacterial cells containing a construct, from 
which expression of antisense nucleic acid against rplL and rplJ, which encode proliferation- 
required SOS ribosomal subunit proteins, is inducible in the presence of IPTG, were grown into 
exponential growth (OD600 0.2 to 0.3) and then diluted 1: 100 into fresh media containing either 400 

30 pM or 0 \sM inducer (IPTG). These cultures were incubated at 37° C for 2.5 hr. After a 2.5 hr 
incubation, induced and non-induced cells were respectively diluted into an assay medium at a final 
OD(K)o value of 0.0004. The medium contained an appropriate concentration of the antibiotic for the 
maintenance of the anti-sense construct. In addition, the medium used to dilute induced cells was 
supplemented with 800 pM IPTG so that addition to the assay plate would result in a final IPTG 

35 concentration of 400 jiM. Induced and non-induced cell suspensions were dispensed (25 ^il/well) 
into the appropriate wells of the assay plate as discussed previously. The plate was then loaded into 
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a plate reader, incubated at constant temperature, and cell growth was monitored in each well by the 
measurement of light scattering at 595 nm. Growth was monitored every 5 minutes until the cell 
culture attained a stationary growth phase. For each concentration of antibiotic, a percentage 
inhibition of growth was calculated at the time point corresponding to mid-exponential growth for 

5 the associated control wells (no antibiotic, plus or minus IPTG). For each antibiotic and condition 
(plus or minus IPTG), a plot of percent inhibition versus log of antibiotic concentration was 
generated and the IC50 determined. A comparison of the !C 5 o for each antibiotic in the presence 
and absence of IPTG revealed whether induction of the antisense construct sensitized the cell to the 
mechanism of action exhibited by the antibiotic. Cells which exhibited a significant (standard 

10 statistical analysis) numerical decrease in the IC 5 o value in the presence of inducer were considered 
to have an increased sensitivity to the test antibiotic. 

The results are provided in the table below, which lists the classes and names of the 
antibiotics used in the analysis, the targets of the antibiotics, the IC50 in the absence of IPTG, the 
IC50 in the presence of IPTG, the concentration units for the IC50s, the fold increase in IC50 in the 

15 presence of IPTG, and whether increased sensitivity was observed in the presence of IPTG. 
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Sensitivity 
Increased? 


o o o o o o o 
2 2 2 2 2 2 2 


Fold Increase 
in Sensitivity 




Cone 
Unit 


ng/ml 
ng/ml 

ng/ml 
ng/ml 

mg/ml 
ng/ml 
ng/ml 


IC50 (+IPTG) 


2484 
24.16 

6025 
45.89 

4677 

181.97 

72550 


8 

flu 

o 

v> 

y 


2782 

6973 
49.61 

4077 
128.9 
145400 


TARGET 


Cell wall biosynthesis 
Cell wall biosynthesis 

DNA Gyrase activity 
DNA Gyrase activity 

Cell membrane function 
Dihydrofolate Reductase activity 
Cell wall biosynthesis 


ANTIBIOTIC CLASS /Names 


OTHER ANTIBIOTIC MECHANISMS 
B-LACTAMS 

Cefoxitin 

Cefotaxime 
DNA SYNTHESIS INHIBITORS 

Nalidixic acid 

Ofloxacin 
OTHER 

Bacitracin 

Trimethoprim 

Vancomycin 
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The above results demonstrate that induction of an antisense RNA to genes encoding SOS 
ribosomal subunit proteins results in a selective and highly significant sensitization of cells to 
antibiotics that inhibit ribosomal function and protein synthesis. The above results further 
5 demonstrate that induction of an antisense construct to an essential gene sensitizes an organism to 
compounds that interfere with that gene products* biological role. This sensitization is restricted to 
compounds that interfere with pathways associated with the targeted gene and it's product. 

Assays utilizing antisense constructs to essential genes can be used to identify compounds 
that specifically interfere with the activity of multiple targets in a pathway. Such constructs can be 
10 used to simultaneously screen a sample against multiple targets in one pathway in one reaction 
(Combinatorial HTS). 

Furthermore, as discussed above, panels of antisense construct containing cells may be used 
to characterize the point of intervention of any compound affecting an essential biological pathway 
including antibiotics with no known mechanism of action. 

15 Another embodiment of the present invention is a method for determining the pathway 

against which a test antibiotic compound is active in which the activity of target proteins or nucleic 
acids involved in proliferation-required pathways is reduced by contacting cells with a sublethal 
concentration of a known antibiotic which acts against the target protein or nucleic acid. In one 
embodiment, the target protein or nucleic acid is a target protein or nucleic acid corresponding to a 

20 proliferation-required nucleic acid identified using the methods described above. The method is 
similar to those described above for determining which pathway a test antibiotic acts against except 
that rather than reducing the activity or level of a proliferation-required gene product using a 
sublethal level of antisense to a proliferation-required nucleic acid, the activity or level of the 
proliferation-required gene product is reduced using sublethal level of a known antibiotic which 

25 acts against the proliferation required gene product 

Interactions between drugs which affect the same biological pathway has been described in 
the literature. For example, Mecillinam (Amdinocillin) binds to and inactivates the penicillin 
binding protein 2 (PBP2, product of the mrdA in E. coif). This antibiotic inteacts with other 
antibiotics that inhibit PBP2 as well as antibiotics that inhibit other penicillin binding proteins such 

30 as PBP3 [(Gutmann, L., Vincent, S., Billot-Klein, D., Acar, J.F., Mrena, E., and Williamson, R. 
(1986) Involvement of penicillin-binding protein 2 with other penicillin-binding proteins in lysis of 
Escherichia coli by some beta-lactam antibiotics alone and in synergistic lytic effect of 
amdinocillin (mecillinam). Antimicrobial Agents & Chemotherapy, 30:906-912)]. Interactions 
between drugs could, therefore, involve two drugs that inhibit the same target protein or nucleic 

35 acid or inhibit different proteins or nucleic acids in the same pathway [(Fukuoka, T., Domon, H., 
Kakuta, M, Ishii, C, Hirasawa, A., Utsui, Y., Ohya, S., and Yasuda, H. (1997) Combination effect 
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between panipenem and vancomycin on highly methicillin-resistant Staphylococcus aureus. Japan. 
J. Antibio. 50:41 1-419; Smith, C.E., Foleno, B.E., Barrett, J.F., and Frosc, M.B. (1997) Assessment 
of the synergistic interactions of levofloxacin and ampicillin against Enterococcus faecium by the 
checkerboard agar dilution and time-kill methods. Diagnos. Microbiol. Infect. Disease 27:85-92; 
5 den Hollander, J.G., Horrevorts, A.M., van Goor, M.L., Verbrugh, H.A., and Mouton, J.W. (1997) 
Synergism between tobramycin and ceftazidime against a resistant Pseudomonas aeruginosa strain, 
tested in an in vitro pharmacokinetic model. Antimicrobial Agents & Chemotherapy. 41:95-1 10)]. 

Two drugs may interact even though they inhibit different targets. For example, the proton 
pump inhibitor, Omeprazole, and the antibiotic, Amoxycillin, two synergistic compounds acting 

10 together, can cure Helicobacter pylori infection [( Gabryelewicz, A., Laszewicz, W., 
Dzieniszewski, J., Ciok, J., Marlicz, K., Bielecki, D., Popiela, T., Legutko, J., Knapik, Z., 
Poniewierka, E. (1997) Multicenter evaluation of dual-therapy (omeprazol and amoxycillin) for 
Helicobacter pylori- associated duodenal and gastric ulcer (two years of the observation). J. Physiol. 
Pharmacol. 48 Suppl 4:93-105)]. 

15 The growth inhibition from the sublethal concentration of the known antibiotic may be at 

least about 5%, at least about 8%, at least about 10%, at least about 20%, at least about 30%, at 
least about 40%, at least about 50%, at least about 60%, or at least about 75%, or more. 

Alternatively, the sublethal concentration of the known antibiotic may be determined by 
measuring the activity of the target proliferation-required gene product rather than by measuring 

20 growth inhibition. 

Cells are contacted with a combination of each member of a panel of known antibiotics at a 
sublethal level and varying concentrations of the test antibiotic. As a control, the cells are contacted 
with varying concentrations of the test antibiotic alone. The IC50 of the test antibiotic in the 
presence and absence of the known antibiotic is determined. If the IC50s in the presence and 

25 absence of the known drug are substantially similar, then the test drug and the known drug act on 
different pathways. If the IC50S are substantially different, then the test drug and the known drug 
act on the same pathway. 

Another embodiment of the present invention is a method for identifying a candidate 
compound for use as an antibiotic in which the activity of target proteins or nucleic acids involved 

30 in proliferation-required pathways is reduced by contacting cells with a sublethal concentration of a 
known antibiotic which acts against the target protein or nucleic acid. In one embodiment, the 
target protein or nucleic acid is a target protein or nucleic acid corresponding to a proliferation- 
required nucleic acid identified using the methods described above. The method is similar to those 
described above for identifying candidate compounds for use as antibiotics except that rather than 

35 reducing the activity or level of a proliferation-required gene product using a sublethal level of 
antisense to a proliferation-required nucleic acid, the activity or level of the proliferation-required 
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gene product is reduced using a sublethal level of a known antibiotic which acts against the 
proliferation required gene product. 

The growth inhibition from the sublethal concentration of the known antibiotic may be at 
least about 5%, at least about 8%, at least about 10%, at least about 20%, at least about 30%, at 
least about 40%, at least about 50%, at least about 60%, or at least about 75%, or more. 

Alternatively, the sublethal concentration of the known antibiotic may be determined by 
measuring the activity of the target proliferation-required gene product rather than by measuring 
growth inhibition. 

In order to characterize test compounds of interest, cells are contacted with a panel of 
known antibiotics at a sublethal level and one or more concentrations of the test compound. As a 
control, the cells are contacted with the same concentrations of the test compound alone. The IC 5 o 
of the test compound in the presence and absence of the known antibiotic is determined. If the IC 50 
of the test compound is substantially different in the presence and absence of the known drug then 
the test compound is a good candidate for use as an antibiotic. As discussed above, once a 
candidate compound is identified using the above methods its structure may be optimized using 
standard techniques such as combinatorial chemistry. 

Representative known antibiotics which may be used in each of the above methods are 
provided in the table below. However, it will be appreciated that other antibiotics may also be used. 



ANTIBIOTIC 


INHIBITS/TARGET 


RESISTANT 
MUTANTS 


Inhibitors of Transcription 






Rifamycin, 1959 Rifampicin 


Inhibits initiation of transcription/B-subunit 


rpoB, crp f cyaA 


Rifabutin Rifaximin 


RNA polymerase, rpoB 




Streptolydigin 


Accelerates transcription chain 


rpoB 




termination/B-subunit RNA polymerase 




Streptovaricin 


an acyclic ansamycin, inhibits RNA 


rpoB 




polymerase 




Actinomycin D+EDTA 


Intercalates between 2 successive G-C 


pldA 




pairs, rpoB, inhibits RNA synthesis 





Inhibitors of Nucleic Acid Metabolism 

Quinolones, 1962 Nalidixic a subunit gyrase and/or topoisomerase IV, 
acid Oxolinic acid gyrA 



Fluoroquinolones 
Ciprofloxacin, 1983 
Norfloxacin 
Coumerins Novobiocin 



Coumermycin 



Albicidin 
Metronidazole 



a subunit gyrase, gyrA and/or 
topoisomerase IV (probable target in Staph) 

Inhibits ATPase activity of B-subunit 
gyrase, gyrB 

Inhibits ATPase activity of B-subunit 
gyrase, gyrB 
DNA synthesis 

Causes single-strand breaks in DNA 



gyrAorB, icd, sloB 
gyrA 

norA (efflux in Staph) 
hipQ 

gyrB, cysB, cysE, nov, 

ompA 
gyrB, hisW 

tsx (nucleoside channel) 
nar 
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Inhibitors of Metabolic Pathways 



Sulfonamides, 1932 

Sulfanilamide 
Trimethoprim, 1962 
Showdomycin 



Thiolactomycin 



Psicofuranine 
Triclosan 

Diazoborines Isoniazid, 
Ethionamide 



blocks synthesis of dihydrofolate,dihydro- 

pteroate synthesis, folP 

Inhibits dihydrofolate reductase, folA 

Nucleoside analogue capable of alkylating 

sulfhydryl groups, inhibitor of thymidylate 

synthetase 

type II fatty acid synthase inhibitor 



Adenosine glycoside antibiotic, target is 
GMP synthetase 
Inhibits fatty acid synthesis 
heterocyclic, contains boron, inhibit fatty 
acid synthesis, enoyl-ACP reductase, J&A/ 



folP, gpt, pabA, pabB, 
pabC 
folA, thyA 
nupC.pnp 



emrB 

fadB, emrB due to gene 

dosage 
guaA,B 



fab! (envM) 
fabl (envM) 



Inhibitors of Translation 
Phenylpropanoids 

Chloramphenicol, 1947 



Tetracyclines, 1948, type II 

polyketides 

Minocycline 

Doxycycline 
Macrolides (type I polyketides) 

Erythromycin, 1950 

Carbomycin, Spiramycin 

etc 

Aminoglycosides Streptomycin, 
1944 

Neomycin 

Spectinomycin 
Kanamycin 

Kasugamycin 

Gentamicin, 1963 

Amikacin 

Paromycin 
Lincosamides 

Lincomycin, 1955 

Clindamycin 
Streptogramins Virginiamycin, 

1955 Pristinamycin 
Synercid: quinupristin 

/dalfopristin 
Fusidanes 

Fusidic Acid 
Kirromycin (Mocimycin) 

Pulvomycin 
Thiopeptin 



Binds to ribosomai peptidyl transfer center 
preventing peptide translocation/ binds to 
S6, L3, L6, L14, L16, L25, L26, L27, but 
preferentially to L 1 6 

Binding to 30S ribosomai subunit, "A" site 
on 30S subunit, blocks peptide elongation, 
strongest binding to S7 

Binding to 50 S ribosomai subunit, 23S 
rRNA, blocks peptide translocation, LI 5, 
L4,L12 



Irreversible binding to 30S ribosomai 
subunit, prevents translation or causes 
mistranslation of mRNA/I6S rRNA 



Binding to 50 S ribosomai subunit, blocks 
peptide translocation 

2 components, Streptogramins A&B, bind 
to the 50S ribosomai subunit blocking 
peptide translocation and peptide bond 
formation 

Inhibition of elongation fector G (EF-G) 
prevents peptide translocation 

Inhibition of elongation factor TU (EF-Tu), 

prevents peptide bond formation 

Binds to and inhibits EF-TU 

Sulfur-containing antibiotic, inhibits protein 
synthesis,EF-G 



rrn, cmlA, mar A, ompF, 
ompR 

clmA (cmr), mar, ompF 



rrn, rplC, rplD, rplV , 
mac 



rpsL, strC,M, ubiF 
atpA-E, ecfB, 

hemAC,D,E,G, topA, 
rpsC,D,E, rrn, spcB 
atpA-atpE, cpxA t ecfB, 

hemA,B,L, topA 
ksgA,B,QD, rp\B,K, 

rpsI.NMR 
rplF, ubiF 
cpxA 
rpsL 



UnB, rplN,0, rpsG 



fusA 
tufA,B 



rplE 
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Tiamulin 
Negamycin 

Oxazolidinones Linezolid 
Isoniazid 

Nitrofurantoin 



Pseudomonic Acids Mupirocin 
(Bactroban) 



Indolmycin 
Viomycin 



Thiopeptides 

Thiostrepton 

Micrococcin 



Inhibits protein synthesis 

Inhibits termination process of protein 

synthesis 
23S rRNA 



Inhibits protein synthesis, nitroreductases 
convert nitrofurantoin to highly reactive 
electrophilic intermediates which attack 
bacterial ribosomal proteins non- 
specifically 

Inhibition of isoleucyl tRN A synthetase- 
used for Staph, topical cream, nasal 
spray 

Inhibits tryptophanyl-tRNA synthetase 



Binds to LI 1-23S RNA complex 
Inhibits GTP hydrolysis by EF-G 
Stimulates GTP hydrolysis by EF-G 



rplC. rplD 
prfB 



pdx 
nfnA,B 



ileS 



trpS 

rrmA (23S rRNA 
methyltransferase; 
mutant has slow 
growth rate, slow 
chain elongation rate, 
and viomycin 
resistance) 



Inhibitors of Cell Walls/Membranes 



6-lactams 

Penicillin, 1929 Ampicillin 

Methicillin, 1960 



Cephalosporins, 1962 
Mecillinam (amdinocillin) 

Aztreonam (Furazlocillin) 
Bacilysin, Tetaine 
Glycopeptides Vancomycin, 1955 

Polypeptides Bacitracin 

Cyclic lipopeptide Daptomycin, 
1980 



Cyclic polypeptides Polymixin, 

1939 
Fosfomycin, L969 



Inhibition of one or more cell wall 
transpeptidases, endopeptidases, and 
glycosidases (PBPs), of the 12 PBPs only 2 
are essential: mrdA (PBP2) and ftsl (pbpB> 
PBP3) 



Cycloserine 



Binds to and inactivates PBP2 (mrdA) 
Inactivates PBP3 (ftsl) 

Dipeptide, inhib glucosamine synthase 
Inhib G+ cell wall syn, binds to terminal 
D-ala-D-ala of pentapeptide, 
Prevents dephosphorylation and 
regeneration of lipid carrier 
Disrupts multiple aspects of membrane 
function, including peptidoglycan 
synthesis, lipoteichoic acid synthesis, and 
the bacterial membrane potential 
Surfactant action disrupts cell membrane 
lipids, binds lipid A mioety of LPS 
Analogue of P-enolpyruvate, inhibits 1 st 
step in peptidoglycan synthesis - UDP-N- 
acetylglucosamine enolpyruvyl 
transferase, murA. Also acts as 
Immunosuppressant 
Prevents formation of D-ala dimer, 
inhibits D-ala ligase, dcRA.B 



ampQ ampD, ampE, 
envZ, galU, hipA, 
hipQ ompC, ompF t 
ompR, ptsl, rfa, 
tolD. tolE 

tonB 

alaS, argS, crp, cyaA, 
envB, mrdA,B, 
mreB,C,D 

dppA 



rfa 



pmrA 

murA, crp, cyaA glpT, 
hipA, ptsl uhpT 



hipA, cycA 
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Alafosfaiin phosphonodipeptide, eel] wall synthesis pepA. tpp 

inhibitor, potentiator of ^-lactams 

Inhibitors of Protein Processing/Transport 

Globomycin Inhibits signal peptidase 11 (cleaves Ipp, dnaE 

prolipoproteins subsequent to lipid 
modification, IspA 

EXAMPLE 12 

Transfer of Exogenous Nucleic Acid Sequences to other Bacterial Species Using the E coli Expression 
Vectors or Expression Vectors Functional in Bacterial Species other than E colt. 

5 Molecule No. EcXA059, encoding a portion of the ypjA gene of Escherichia coli, was 

either transformed directly into Enterobacter cloacae, Salmonella typhimurium and/or Klebsiella 
pneumoniae or subcloned into an expression vector functional in these species and the subclones 
transformed into these organisms. Suitable expression vectors are well known in the art. These 
expression vectors were introduced into Enterobacter cloacae, Salmonella typhimurium and/or 

10 Klebsiella pneumoniae cells that were then assayed for growth inhibition according to the method 
of Example 1. After growth in liquid culture, cells were plated at various serial dilutions and a 
score determined by calculating the log difference in growth for INDUCED vs. UNINDUCED 
antisense RNA expression as determined by the maximum 10 fold dilution at which a colony was 
observed. If there was no effect of antisense RNA expression in one organism, the clone is given a 

15 score of zero "0" in that organism. In contrast, a score of "8" means that 1 0 8 times more ceils were 
required to observe a colony formed on the induced state than in the non-induced state under the 
conditions used and in that organism. 

Expression vectors containing Molecule No. EcXA059 were found to inhibit bacterial 
growth in all four organisms when expression of the antisense RNA was induced with IPTG. A 

20 score of 8 was assigned for Escherichia coli, Enterobacter cloacae, and Salmonella typhimurium 
and in Klebsiella pneumoniae the score was >7. The protein encoded by this sequence is used as a 
target sequence to screen candidate compound libraries as described above. 

In addition, the above methods were validated using other antisense nucleic acids which 
inhibit the growth of E. coli which were identified using methods similar to those described above. 

25 Expression vectors which inhibited growth of E coli upon induction of antisense RNA expression 
with IPTG were transformed directly into Enterobacter cloacae, Klebsiella pneumonia or 
Salmonella typhimurium. The transformed cells were then assayed for growth inhibition according 
to the method of Example I. After growth in liquid culture, cells were plated at various serial 
dilutions and a score determined by calculating the log difference in growth for INDUCED vs. 

30 UNINDUCED antisense RNA expression as determined by the maximum 10 fold dilution at which 
a colony was observed. The results of these experiments are listed in Table V below. If there was 
no effect of antisense RNA expression in a microorganism, the clone is minus in Table V below. In 
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contrast, a positive in Table V below means that at least 1 0 fold more cells were required to observe 
a colony on the induced plate than on the non-induced plate under the conditions used and in that 
microorganism. 

Sixteen of the constructs were found to inhibit growth in all the microorganisms tested upon 
5 induction of antisense RNA expression with IPTG. 

TABLE V 

Sensitivity of Other Microorganisms to Antisense Nucleic Acids That Inhibit Proliferation in E. coli 



MoL No. 


S. typhimurium 


E. cloacae 


#L pneumoniae 


EcXAOOl 


+ 


+ 




EcXA004 


+ 




_ 


EcXAOOS 


+ 


+ 


+ 


EcXA006 






_ 


EcXA007 


m 


! + 


_ 


EcXA008 


+ 




+ 


EcXA009 








EcXAOlO 




+ 


+ 


EcXAOl 1 


_ 


+ 




EcXA012 


_ 


+ 


_ 


EcXA013 


+ 


+ 


+ 


EcXAOl 4 


+ 


+ 




EcXA015 


+ 


+ 


+ 


EcXA016 


+ 


+ 


+ 


EcXAOl 7 




+ 


+ 


EcXA018 


+ 


+ 


+ 


EcXAOl 9 


+ 


+ 


+ 


EcXA020 


+ 


+ 


+ 


| EcXA021 


+ 


+ 


+ 


EcXA023 


+ 


+ 


+ 


EcXA024 




- 


+ 


S EcXA025 




- 


- 


! EcXA026 


+ 


+ 




| EcXA027 




+ 




EcXA028 


+ 






EcXA029 








EcXA030 




+ 




EcXA031 


+ 






EcXA032 


+ 


+ 




EcXA033 


+ 


+ 


+ 


EcXA034 


+ 


+ 


+ 


EcXA035 








EcXA036 


+ 




+ 


EcXA037 


+ 


+ 




EcXA038 


+ 


+ 


+ 


! EcXA039 


+ 






EcXA041 


+ 


+ 


+ 


! EcXA042 




+ 


+ 
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MoL No. 


5L typhlmurium 


EL cloacae 


JL pneumoniae 


EcXA043 


- 


- 


1 


EcXA044 


- 


- 


- 


EcXA045 


+ 


+ 


+ | 


EcXA046 


• 


- 


- 


EcXA047 1 


+ 


+ 


- 


EcXA048 


- 


- 


- 


EcXA049 


+ 


- 


- 


EcXA050 




- 


- 


EcXAOSl 


+ 


- 


- 


EcXA052 


+ 


- 


- 


EcXA053 


+ 


+ 


+ 


j EcXA054 


- 


- 


+ 


EcXA055 


+ 


- 


- 


EcXA056 


+ 


- 


+ 


EcXA057 


+ 


+ 


- 


EcXA058 


- 


- 


- 


EcXA059 


+ 


+ 


+ 


EcXA060 




- 


- 


EcXA061 




- 


- 


EcXA062 


- 


.- 


- 


EcXA063 


+ 


+ 


- 


EcXA064 


- 


- 


- 


EcXA065 


+ 


+ 


■ - 


EcXA066 


- 


- 


- 


EcXA067 




+ 


- 


EcXA068 


- 


- 


- 


EcXA069 




+ 


- 


EcXA070 


- 


- 


- 


EcXA07 1 


+ 


- 


- 


EcXA072 


+ 


- 


+ 


EcXA073 




+ 


+ 


EcXA074 


+ 


+ 


+ 


EcXA075 


+ 


- 


- 


EcXA076 


- 


+ 


- 


EcXA077 


+ 


+ 


- 


EcXA079 


+ 


+ 


+ 


EcXA080 


+ 


- 


- 


EcXA082 


- 


+ 


- 


EcXA083 


- 


- 


- 


EcXA084 


- 


+ 


- 


EcXA086 


- 


- 


- 


EcXA087 




- 


- 


EcXA088 








EcXA089 








EcXA090 








EcXA091 








EcXA092 








EcXA093 
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MoL No. 


£ typhlmurium 


E. cloacae 


K. pneumoniae 


EcXA094 


+ 


+ 


+ 


EcXA095 


+ 






EcXA096 








EcXA097 


+ 






EcXA098 


+ 






EcXA099 








EcXAIOO 








EcXAlOl 








EcXA102 








EcXA103 




+ 




EcXA104 


+ 


+ 


+ 


EcXA106 


+ 


+ 




EcXA107 








EcXA108 








EcXA109 








EcXAllO 


+ 


+ 




EcXAlll 








EcXA112 




+ 




EcXA113 


+ 


+ 




EcXA114 




+ 




EcXAllS 








EcXA116 


+ 


+ 




EcXA117 


+ 






EcXA118 








EcXA119 


+ 


+ 




EcXA120 








EcXA121 








EcXA122 


+ 




+ 


EcXA123 


+ 






EcXA124 








EcXA125 








EcXA126 








EcXA127 


+ 


+ 




EcXA128 








EcXA129 




+ 




EcXA130 


+ 


+ 




EcXA132 








EcXA133 








EcXA 1 36 




_ 


_ 


EcXA137 


_ 


_ 




EcXA 138 








EcXA 139 


_ 


_ 


_ 


EcXA 140 


+ 






EcXA141 


+ 






EcXA 142 








EcXA 143 








EcXA 144 i 








EcXA 145 
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MoL No. 


& typhimurium 


EL cloacae 


K. pneumoniae 


EcXA146 


- 


- 


- 


EcXA147 


- 


- 


- 


EcXA148 


- 


- 


- 


EcXA149 


+ 


+ 


+ 


EcXAlSO 


- 


- 


- 


EcXAlSl 


+ 


- 


- 


EcXA152 




- 


- 


EcXA153 


+ 


+ 


- 


EcXA154 


_ 


- 




EcXA155 


- 


-• 


ND 


EcXA156 


_ 


+ 


- 


EcXA157 


_ 


- 


- 


EcXA!58 




_ 


- 


EcXA159 




- 


- 


EcXAI60 


+ 


_ 


- 


EcXAl62 






_ 


EcXAl63 






- 


EcXAl64 


_ 




- 


EcXAl65 


_ 




- 


EcXA166 


• 




- 


EcXA167 


• 


- 


- 


EcXA168 






- 


EcXAl69 




+ 


• 


EcXA171 






- 


EcXAl72 


- 


- 


- 


EcXA!73 




• 


- 


EcXA174 


• 






EcXA175 






- 


EcXA176 






- 


EcXAl78 




_ 


- 


EcXAl79 


_ 




- 


EcXAI80 


+ 


- 


- 


EcXA181 




- 


• 


EcXAI82 


- 


• 


- 


EcX A 1 83 


- 




- 


EcXA184 


- 


- 


- 


EcXAI85 


- 


- 


- 


EcXAl86 








EcXAl87 


+ 


+ 


+ 


EcXAl89 


+ 






EcXAI90 


+ 


+ 


+ 


EcXA191 


+ 


+ 




EcXAt92 




+ 





Thus, the ability of an antisense nucleic acid which inhibits the proliferation of £ coli to 
inhibit the growth of other organims may be evaluated by either transforming the antisense nucleic 
acid directly into other Escherichia species or inserting the antisense nucleic acid into expression 
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vectors that are functional in other Gram negative species such as Enterobacter cloacae, Salmonella 
typhimurium, and/or Klebsiella pneumoniae . Similarly, the antisense nucleic acid can be inserted 
in expression vectors that are functional in Gram-positive species such as Staphylococcus aureus, 
Enterococcus faecalis and Streptococcus pneumoniae or other species. 
5 Those skilled in the art will appreciate that a negative result in a heterologous microorganism 

does not mean that that microorganism is missing that gene nor does it mean that the gene is 
unessential. However, a positive result means that the heterologous microorganism contains a 
homologous gene which is required for proliferation of that microorganism. The homologous gene 
may be obtained using the methods described herein. Those cells that are inhibited by antisense may 
10 be used in cell based assays as described herein for the identification and characterization of 
compounds in order to develop antibiotics effective in these microorganisms. Those skilled in the art 
will appreciate that an antisense molecule which works in the microorganism from which it was 
obtained will not always work in a heterologous microorganism. 

EXAMPLE 13 

15 Use of Identified Exogenous Nucleic Acid Sequences as Probes 

The identified sequence of the present invention can be used as probes to obtain the 
sequence of additional genes of interest from a second organism. For example, probes to potential 
bacterial target proteins may be hybridized to nucleic acids from other organisms including other 
bacteria and higher organisms, to identify homologous sequences. Such hybridization might 
20 indicate that the protein encoded by the gene to which the probe corresponds is found in humans 
and therefore not necessarily a good drug target. Alternatively, the gene can be conserved only in 
bacteria and therefore would be a good drug target for a broad spectrum antibiotic or antimicrobial. 
Probes derived from the identified nucleic acid sequences of interest or portions thereof can be 
. labeled with detectable labels familiar to those skilled in the art, including radioisotopes and non- 
25 radioactive labels, to provide a detectable probe. The detectable probe can be single stranded or double 
stranded and can be made using techniques known in the art, including in vitro transcription, nick 
translation, or kinase reactions. A nucleic acid sample containing a sequence capable of hybridizing to 
the labeled probe is contacted with the labeled probe. If the nucleic acid in the sample is double 
stranded, it can be denatured prior to contacting the probe. In some applications, the nucleic acid 
30 sample can be immobilized on a surface such as a nitrocellulose or nylon membrane. The nucleic acid 
sample can comprise nucleic acids obtained from a variety of sources, including genomic DNA, cDNA 
libraries, RNA, or tissue samples. 

Procedures used to detect the presence of nucleic acids capable of hybridizing to the detectable 
probe include well known techniques such as Southern blotting, Northern blotting, dot blotting, colony 
35 hybridization, and plaque hybridization. In some applications, the nucleic acid capable of hybridizing 
to the labeled probe can be cloned into vectors such as expression vectors, sequencing vectors, or in 
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vitro transcription vectors to facilitate the characterization and expression of the hybridizing nucleic 
acids in die sample. For example, such techniques can be used to isolate, purify and clone sequences 
from a genomic library, made from a variety of bacterial species, which are capable of hybridizing to 
probes made from the sequences identified in Examples 5 and 6. 
5 EXAMPLE 14 

Preparation of PCR Primers and Amplification of DNA 
The identified E. coli genes corresponding directly to or located within the operon of nucleic 
acid sequences required for proliferation or portions thereof can be used to prepare PCR primers for a 
variety of applications, including the identification or isolation of homologous sequences from other 

10 species, for example S. typhimurium, K cloacae, and Klebsiella pneumoniae, which contain part or all 
of die homologous genes. Because homologous genes are related but not identical in sequence, those 
skilled in the art will often employ degenerate sequence PCR primers. Such degenerate sequence 
primers are designed based on conserved sequence regions, either known or suspected, such as 
conserved coding regions. The successful production of a PCR product using degenerate probes 

IS generated from the sequences identified herein would indicate the presence of a homologous gene 
sequence in the species being screened. The PCR primers are at least 10 bases, and preferably at least 
20 bases in length. More preferably, the PCR primers are at least 20-30 bases in length. In some 
embodiments, the PCR primers can be more than 30 bases in length. It is preferred that the primer 
pairs have approximately the same G/C ratio, so that melting temperatures are approximately the same. 

20 A variety of PCR techniques are familiar to those skilled in the art. For a review of PCR technology, 
see Molecular Cloning to Genetic Engineering White, B.A. Ed. in Methods in Molecular Biology 67: 
Humana Press, Totowa 1997. When the entire coding sequence of the target gene is known, the 5' and 
3' regions of the target gene can be used as the sequence source for PCR probe generation. In each of 
these PCR procedures, PCR primers on either side of the nucleic acid sequences to be amplified are 

25 added to a suitably prepared nucleic acid sample along with dNTPs and a thermostable polymerase 
such as Taq polymerase, Pfu polymerase, or Vent polymerase. The nucleic acid in the sample is 
denatured and the PCR primers are specifically hybridized to complementary nucleic acid sequences in 
the sample. The hybridized primers are extended. Thereafter, another cycle of denaturation, 
hybridization, and extension is initiated. The cycles are repeated multiple times to produce an 

30 amplified fragment containing the nucleic acid sequence between the primer sites. 

EXAMPLE 15 
Inverse PCR 

The technique of inverse polymerase chain reaction can be used to extend the known nucleic 
acid sequence identified in Examples 5 and 6. The inverse PCR reaction is described generally by 
35 Ochman et al., in Ch. 10 of PCR Technology: Principles and Applications for DNA Amplification, 
(Henry A. Erlich, Ed.) W.H. Freeman and Co. (1992). Traditional PCR requires two primers that are 
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used to prime the synthesis of complementary strands of DNA. In inverse PCR, only a core sequence 
need be known. 

Using the sequences identified as relevant from the techniques taught in Examples 5 and 6 and 
applied to other species of bacteria, a subset of exogenous nucleic sequences are identified that 
5 correspond to genes or operons that are required for bacterial proliferation. In species for which a 
genome sequence is not known, the technique of inverse PCR provides a method for obtaining the gene 
in order to determine the sequence or to place the probe sequences in full context to the target sequence 
to which the identified exogenous nucleic acid sequence binds. 

To practice this technique, the genome of the target organism is digested with an appropriate 
1 0 restriction enzyme so as to create fragments of nucleic acid that contain the identified sequence as well 
as unknown sequences that flank the identified sequence. These fragments are then circularized and 
become the template for the PCR reaction. PCR primers are designed in accordance with the teachings 
of Example 15 and directed to the ends of the identified sequence are synthesized. The primers direct 
nucleic acid synthesis away from the known sequence and toward the unknown sequence contained 
15 within the circularized template. After the PCR reaction is complete, the resulting PCR products can 
be sequenced so as to extend the sequence of the identified gene past the core sequence of the identified 
exogenous nucleic acid sequence identified. In this manner, the full sequence of each novel gene can 
be identified. Additionally the sequences of adjacent coding and noncoding regions can be identified. 

EXAMPLE 16 

20 Identification of Genes Required for Staphylococcus aureus Proliferation 

Genes required for proliferation in Staphylococcus aureus are identified according to the 
methods described above. 

EXAMPLE 17 

Identification of Genes Required for Neisseria gonorrhoeae Proliferation 
25 Genes required for proliferation in Neisseria gonorrhoeae are identified according to the 

methods described above. 

EXAMPLE 18 

Identification of Genes Required for Pseudomonas aeruginosa Proliferation 
Genes required for proliferation in Pseudomonas aeruginosa are identified according to the 
30 methods described above. 

EXAMPLE 19 

Identification of Genes Required for Enterococcus faecalis Proliferation 
Genes required for proliferation in Enterococcus faecalis are identified according to the. 
methods described above. 
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EXAMPLE 20 

Identification of Genes Required for Haemophilus influenzae Proliferation 
Genes required for proliferation in Haemophilus influenzae are identified according to the 
methods described above. 
5 EXAMPLE 21 

Identification of Genes Required for Salmonella twhimurium Proliferation 
Genes required for proliferation in Salmonella typhimurium are identified according to the 
methods described above. 

EXAMPLE 22 

10 Identification of Genes Required for Helicobacter pylori Proliferation 

Genes required for proliferation in Helicobacter pylori are identified according to the 
methods described above. 

EXAMPLE 23 

Identification of Genes Required for Mycoplasma pneumoniae Proliferation 
IS Genes required for proliferation in Mycoplasma pneumoniae are identified according to the 

methods described above. 

EXAMPLE 24 

Identification of Genes Required for Plasmodium ovale Proliferation 
Genes required for proliferation in Plasmodium ovale are identified according to the methods 
20 described above. 

EXAMPLE 25 

Identification of Genes Required for Saccharomvces cerevisiae Proliferation 
Genes required for proliferation in Saccharomyces cerevisiae are identified according to the 
methods described above. 
25 EXAMPLE 26 

Identification of Genes Required for Entamoeba histolytica Proliferation 
Genes required for proliferation in Entamoeba histolytica are identified according to the 
methods described above. 

EXAMPLE 27 

30 Identification of Genes Required for Candida albicans Proliferation 

Genes required for proliferation in Candida albicans are identified according to the methods 
described above. 



WO 01/34810 



PCT/US00/30950 



EXAMPLE 28 

Identification of Genes Required for Klebsiella pneumoniae Proliferation 
Genes required for proliferation in Klebsiella pneumoniae are identified according to the 
methods described above. 
5 EXAMPLE 29 

Identification of Genes Required for Salmonella typhi Proliferation 
Genes required for proliferation in Salmonella typhi are identified according to the methods 
described above. 

EXAMPLE 30 

10 Identification of Genes Required for Salmonella paratyphi Proliferation 

Genes required for proliferation in Salmonella paratyphi are identified according to the 
methods described above. 

EXAMPLE 31 

Identification of Genes Required for Salmonella cholerasuis Proliferation 
15 Genes required for proliferation in Salmonella cholerasuis are identified according to the 

methods described above. 

EXAMPLE 32 

Identification of Genes Required for Staphylococcus epidermis Proliferation 
Genes required for proliferation in Staphylococcus epidermis are identified according to the 
20 methods described above. 

EXAMPLE 33 

Identification of Genes Required for Mycobacterium tuberculosis Proliferation 
Genes required for proliferation in Mycobacterium tuberculosis are identified according to the 
methods described above. 
25 EXAMPLE 34 

Identification of Genes Required for Mycobacterium leprae Proliferation 
Genes required for proliferation in Mycobacterium leprae are identified according to the 
methods described above. 

EXAMPLE 35 

30 Identification of Genes Required for Treponema pallidum Proliferation 

Genes required for proliferation in Treponema pallidum are identified according to the 
methods described above. 

EXAMPLE 36 

Identification of Genes Required for Bacillus anthracis Proliferation 
35 Genes required for proliferation in Bacillus anthracis are identified according to the methods 

described above. 
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EXAMPLE 37 

Identification of Genes Required for Yersinia pestis Proliferation 
Genes required for proliferation in Yersinia pestis are identified according to the methods 
described above. 
5 EXAMPLE 38 

Identification of Genes Required for Clostridium botulinum Proliferation 
Genes required for proliferation in Clostridium botulinum are identified according to the 
methods described above. 

EXAMPLE 39 

10 Identification of Genes Required for Campylobacter jejuni Proliferation 

Genes required for proliferation in Campylobacter jejuni are identified according to the 
methods described above. 

EXAMPLE 40 

Identification of Genes Required for Chlamydia trachomatis Proliferation 
15 Genes required for proliferation in Chlamydia trachomatis are identified according to the 

methods described above. 

Use of Isolated Exogenous Nucleic Acid Fragments as Antisense Antibiotics 

In addition to using the identified sequences to enable screening of molecule libraries to 
20 identify compounds useful to identify antibiotics, the sequences themselves can be used as therapeutic 
agents. Specifically, the identified exogenous sequences in an antisense orientation can be provided to 
an individual to inhibit the translation of a bacterial target gene. 

Generation of Antisense Therapeutics from Identified Exogenous Sequences 
The sequences of the present invention can be used as antisense therapeutics for the 
25 treatment of bacterial infections or simply for inhibition of bacterial growth in vitro or in vivo. The 
therapy exploits the biological process in cells where genes are transcribed into messenger RNA 
(mRNA) that is then translated into proteins. Antisense RNA technology contemplates the use of 
antisense oligonucleotides directed against a target gene that will bind to its target and decrease or 
inhibit the translation of the target mRNA. In one embodiment, antisense oligonucleotides can be 
30 used to treat and control a bacterial infection of a cell culture containing a population of desired 
cells contaminated with bacteria. In another embodiment, the antisense oligonucleotides can be 
used to treat an organism with a bacterial infection. 

Antisense oligonucleotides can be synthesized from any of the sequences of the present 
invention using methods well known in the art. In a preferred embodiment, antisense 
35 oligonucleotides are synthesized using artificial means. Uhlmann & Peymann, Chemical Rev. 
90:543-584 (1990) review antisense oligonucleotide technology in detail. Modified or unmodified 
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antisense oligonucleotides can be used as therapeutic agents. Modified antisense oligonucleotides 
are preferred since it is well known that antisense oligonucleotides are extremely unstable. 
Modification of the phosphate backbones of the antisense oligonucleotides can be achieved by 
substituting the intemucleotide phosphate residues with methylphosphonates, phosphorothioates, 

5 phosphoramidates, and phosphate esters. Nonphosphate intemucleotide analogs such as siloxane 
bridges, carbonate brides, thioester bridges, as well as many others known in the art. The 
preparation of certain antisense oligonucleotides with modified intemucleotide linkages is described 
in U.S. Patent No. 5,142,047. 

Modifications to the nucleoside units of the antisense oligonucleotides are also 

0 contemplated. These modifications can increase the half-life and increase cellular rates of uptake 
for the oligonucleotides in vivo. For example, ct-anomeric nucleotide units and modified bases such 
as 1,2-dideoxy-d-ribofuranose, 1,2-dideoxy-l-phenylribofuranose, and JV\ A^-ethano-S-methyl- 
cytosine are contemplated for use in the present invention. 

An additional form of modified antisense molecules is found in peptide nucleic acids. 

5 Peptide nucleic acids (PNA) have been developed to hybridize to single and double stranded nucleic 
acids. PNA are nucleic acid analogs in which the entire deoxyribose-phosphate backbone has been 
exchanged with a chemically completely different, but structurally homologous, polyamide (peptide) 
backbone containing 2-aminoethyl glycine units. Unlike DNA, which is highly negatively charged, the 
PNA backbone is neutral. Therefore, there is much less repulsive energy between complementary 

0 strands in a PNA-DNA hybrid than in the comparable DNA-DNA hybrid, and consequently they are 
much more stable. PNA can hybridize to DNA in either a Watson/Crick or Hoogsteen fashion 
(Demidov et al., Proc. Natl. Acad. Sci. U.SA. 92:2637-2641, 1995; Egholm, Nature 365:566-568, 
1993; Nielsen et al., Science 254:1497-1500, 1991; Dueholm et al., New J. Chem. 21:19-31, 1997). 

Molecules called PNA "clamps" have been synthesized which have two identical PNA 

5 sequences joined by a flexible hairpin linker containing three 8-amino3,6-dioxaoctanoic acid units. 
When a PNA clamp is mixed with a complementary homopurine or homopyrimidine DNA target 
sequence, a PNA-DNA-PNA triplex hybrid can form which has been shown to be extremely stable 
(Bentin et al., Biochemistry 35:8863-8869, 1996; Egholm et al., Nucleic Acids Res. 23:217-222, 1995; 
Griffith et al.,/. Am. Chem. Soc. 117:831-832, 1995). 

9 The sequence-specific and high affinity duplex and triplex binding of PNA have been 

extensively described (Nielsen et al., Science 254:1497-1500, 1991; Egholm et al., J. Am Chem. Soc. 
114:9677-9678, 1992; Egholm et aL, Nature 365:566-568, 1993; Almarsson et al., Proc. Natl Acad 
ScL USA. 90:9542-9546, 1993; Demidov et aL, Proc. Natl Acad Sci. U.SA. 92:2637-2641, 1995). 
They have also been shown to be resistant to nuclease and protease digestion (Demidov et al., 

5 Biochem. Pharm. 48:1010-1313, 1994). PNA has been used to inhibit gene expression (Hanvey et al., 
Science 258:1481-1485,1992; Nielsen et aL, Nucl. Acids. Res., 21:197-200, 1993; Nielsen et al., Gene 
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149:139-145, 1994; Good & Nielsen, Science, 95: 2073-2076, 1998), to block restriction enzyme 
activity (Nielsen et al, supra, 1993), to act as an artificial transcription promote (Mollegaaid, Proc. 
NatL Acad Set U.S.A. 91:3892-3895, 1994) and as a pseudo restriction endonuclease (Demidov et al., 
Nucl. Acids. Res. 21:2103-2107, 1993). Recently, PNA has also been shown to have antiviral and 
5 antitumoral activity mediated through an antisense mechanism (Norton, Nature Biotechnol, 14:615- 
619, 1996; Hirschman et al., J. Investig. Med 44:347-351, 1996). PNAs have been linked to various 
peptides in order to promote PNA entry into cells (Basu et aL, Bioconj. Chem. 8:481-488, 1997; 
Pardridge et al., Proc. Nad. Acad Sci USA. 92:5592-5596, 1995). 

The antisense oligonucleotides contemplated by the present invention can be administered 

10 by direct application of oligonucleotides to a target using standard techniques well known in the art. 
The antisense oligonucleotides can be generated within the target using a plasmid, or a phage. 
Alternatively, the antisense nucleic acid may be expressed from a sequence in the chromosome of 
the target cell. It is further contemplated that contemplated that the antisense oligonucleotide 
contemplated are incorporated in a ribozyme sequence to enable the antisense to specifically bind and 

15 cleave its target mRNA. For technical applications of ribozyme and antisense oligonucleotides see 
Rossi et al., Pharmacol. Ther. 50(2):245-254, (1991). The present invention also contemplates 
using a retron to introduce an antisense oligonucleotide to a cell. Retron technology is exemplified 
by U.S. Patent No. 5,405,775. Antisense oligonucleotides can also be delivered using liposomes or 
by electroporation techniques which are well known in the art. 

20 The antisense nucleic acids of the present invention can also be used to design antibiotic 

compounds comprising nucleic acids which function by intracellular triple helix formation. Triple 
helix oligonucleotides are used to inhibit transcription from a genome. The sequences identified as 
required for proliferation in the present invention, or portions thereof, can be used as templates to 
inhibit microorganism gene expression in individuals infected with such organisms. Traditionally, 

25 homopurine sequences were considered the most useful for triple helix strategies. However, 
homopyrimidine sequences can also inhibit gene expression. Such homopyrimidine oligonucleotides 
bind to the major groove at homopurine:homopyrimidine sequences. Thus, both types of sequences 
based on the sequences of the present invention that are required for proliferation are contemplated for 
use as antibiotic compound templates. 

30 The antisense oligonucleotides of this example employ the identified sequences of the 

present invention to induce bacterial cell death or at least bacterial stasis by inhibiting target gene 
translation. Antisense oligonucleotides containing from about 8 to 40 bases of the sequences of the 
present invention have sufficient complementary to form a duplex with the target sequence under 
physiological conditions. 

35 To kill bacterial cells or inhibit their growth, the antisense oligonucleotides are applied to 

the bacteria or to the target cells under conditions that facilitate their uptake. These conditions 
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include sufficient incubation times of cells and oligonucleotides so that the antisense 
oligonucleotides are taken up by the cells. In one embodiment, an incubation period of 7-10 days is 
sufficient to kill bacteria in a sample. An optimum concentration of antisense oligonucleotides is 
selected for use. 

5 The concentration of antisense oligonucleotides to be used can vary depending on the type 

of bacteria sought to be controlled, the nature of the antisense oligonucleotide to be used, and the 
relative toxicity of the antisense oligonucleotide to the desired cells in the treated culture. 
Antisense oligonucleotides can be introduced to cell samples at a number of different concentrations 
preferably between lxlO' 10 M to lxl O^M. Once the minimum concentration that can adequately 

10 control gene expression is identified, the optimized dose is translated into a dosage suitable for use in 
vivo. For example, an inhibiting concentration in culture of lxl 0~ 7 translates into a dose of 
approximately 0.6 mg/kg body weight Levels of oligonucleotide approaching 100 mg/kg body weight 
or higher may be possible after testing the toxicity of the oligonucleotide in laboratory animals. It is 
additionally contemplated that cells from the subject are removed, treated with the antisense 

15 oligonucleotide, and reintroduced into the subject. This range is merely illustrative and one of skill in 
the art are able to determine the optimal concentration to be used in a given case. 

After the bacterial cells have been killed or controlled in a desired culture, the desired cell 
population may be used for other purposes. 

EXAMPLE 41 

20 The following example demonstrates the ability of an E. coli antisense oligonucleotide to 

act as a bactericidal or bacteriostatic agent to treat a contaminated cell culture system. The 
application of the antisense oligonucleotides of the present invention are thought to inhibit the 
translation of bacterial gene products required for proliferation. 

The antisense oligonucleotide of this example corresponds to a 30 base phophorothioate 

25 modified oligodeoxynucelotide complementary to a nucleic acid involved in proliferation, such as 
Molecule Number EcXA056 (SEQ ID NO: 1). A sense oligodeoxynucelotide complementary to 
the antisense sequence is synthesized and used as a control. The oligonucleotides are synthesized 
and purified according to the procedures of Matsukura, et al., Gene 72:343 (1988). The test 
oligonucleotides are dissolved in a small volume of autoclaved water and added to culture medium 

30 to make a 1 00 micromolar stock solution. 

Human bone marrow cells are obtained from the peripheral blood of two patients and 
cultured according standard procedures well known in the art. The culture is contaminated with the 
K- 12 strain of E. coli and incubated at 37°C overnight to establish bacterial infection. 

The control and antisense oligonucleotide containing solutions are added to the 

35 contaminated cultures and monitored for bacterial growth. After a 10 hour incubation of culture 
and oligonucleotides, samples from the control and experimental cultures are drawn and analyzed 
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for the translation of the target bacterial gene using standard microbiological techniques well 
known in the art The target E. coli gene is found to be translated in the control culture treated with 
the control oligonucleotide, however, translation of the target gene in the experimental culture 
treated with the antisense oligonucleotide of the present invention is not detected or reduced. 

5 EXAMPLE 42 

A subject suffering from an E. coli infection is treated with the antisense oligonucleotide 
preparation of Example 39. The antisense oligonucleotide is provided in a pharmaceutically 
acceptable carrier at a concentration effective to inhibit the translation of the target gene. The 
present subject is treated with a concentration of antisense oligonucleotide sufficient to achieve a 

10 blood concentration of about 100 micromolar. The patient receives daily injections of antisense 
oligonucleotide to maintain this concentration for a period of 1 week. At the end of the week a 
blood sample is drawn and analyzed for the presence or absence using standard techniques well 
known in the art There is no detectable evidence of £. coli and the treatment is terminated. 

EXAMPLE 43 

15 Preparation and use of Triple Helix Probes 

The sequences of microorganism genes required for proliferation of the present invention are 
scanned to identify 10-mer to 20-mer homopyrimidine or homopurine stretches that could be used in 
triple-helix based strategies for inhibiting gene expression. Following identification of candidate 
homopyrimidine or homopurine stretches, their efficiency in inhibiting gene expression is assessed by 

20 introducing varying amounts of oligonucleotides containing the candidate sequences into a population 
of bacterial cells that normally express the target gene. The oligonucleotides may be prepared on an 
oligonucleotide synthesizer or they may be purchased commercially from a company specializing in 
custom oligonucleotide synthesis, such as GENSET, Paris, France. 

The oligonucleotides can be introduced into the cells using a variety of methods known to 

25 those skilled in the art, including but not limited to calcium phosphate precipitation, DEAE-Dextran, 
electroporation, liposome-mediated transfection or native uptake. 

Treated cells are monitored for a reduction in proliferation using techniques such as 
monitoring growth levels as compared to untreated cells using optical density measurements. The 
oligonucleotides that are effective in inhibiting gene expression in cultured cells can then be introduced 

30 in vivo using the techniques well known in that art at a dosage level shown to be effective. 

In some embodiments, the natural (beta) anomers of the oligonucleotide units can be replaced 
with alpha anomers to render the oligonucleotide more resistant to nucleases. Further, an intercalating 
agent such as ethidium bromide, or the like, can be attached to the 3' end of the alpha oligonucleotide 
to stabilize the triple helix. For information on the generation of oligonucleotides suitable for triple 

35 helix formation see Griffin etal. (Science 245:967-971 (1989)). 
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EXAMPLE 44 

Identification of Bacterial Strains from Isolated Specimens by PCR 
Classical bacteriological methods for the detection of various bacterial species are time 
consuming and costly. These methods include growing the bacteria isolated from a subject in 
5 specialized media, cultivation on selective agar media, followed by a set of confirmation assays that 
can take from 8 to 10 days or longer to complete. Use of the identified sequences of the present 
invention provides a method to dramatically reduce the time necessary to detect and identify 
specific bacterial species present in a sample. 

In one exemplary method, bacteria are grown in enriched media and DNA samples are isolated 
10 from specimens of, for example, blood, urine, stool, saliva or central nervous system fluid by 
conventional methods. A panel of PCR primers based on identified sequences unique to various 
species of microorganisms are then utilized in accordance with Example 12 to amplify DNA of 
approximately 100-200 bases in length from the specimen. A separate PCR reaction is set up for each 
pair of PCR primers and after the PCR reaction is complete, the reaction mixtures are assayed for the 
15 presence of PCR product. The presence or absence of bacteria from the species to which the PCR 
primer pairs belong is determined by the presence or absence of a PCR product in the various test PCR 
reaction tubes. 

Although the PCR reaction is used to assay the isolated sample for the presence of various 
bacterial species, other assays such as the Southern blot hybridization are also contemplated. 

20 
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WHAT IS CLAIMED IS : 

1 . A purified or isolated nucleic acid sequence consisting essentially of one of SEQ 
ID NOs: 1-127, wherein expression of said nucleic acid inhibits proliferation of a microorganism. 

2. The nucleic acid sequence of Claim 1, wherein said nucleic acid sequence is 
5 complementary to at least a portion of a coding sequence of a gene whose expression is required for 

proliferation of a microorganism. 

3. The nucleic acid of Claim 1, wherein said nucleic acid sequence is complementary 
to at least a portion of an RNA required for proliferation of a microorganism. 

4. The nucleic acid of Claim 3, wherein said RNA is an RNA encoding more than one 
10 gene product. 

5. A nucleic acid comprising a fragment of one of SEQ ID NOs.: 1-127, said fragment 
selected from the group consisting of fragments comprising at least 10, at least 20, at least 25, at 
least 30, at least 50 and more than 50 consecutive bases of one of SEQ ID NOs: 1-127. 

6. A vector comprising a promoter operably linked to the nucleic acid sequence of 
15 Claims 1,2,3,4, or 5. 

7. The vector of Claim 6, wherein said promoter is active in a microorganism selected 
from the group consisting of Aspergillus fumigatus, Bacillus anthracis, Campylobacter jejuni, 
Candida albicans, Chlamydia pneumoniae, Chlamydia trachomatus, Clostridium botulinum, 
Cryptococcus neoformans, Enterobacter cloacae, Enterococcus Jaecalis, Escherichia coli, 

20 Haemophilus influenzae, Helicobacter pylori, Klebsiella pneumoniae, Mycobacterium leprae, 
Mycobacterium tuberculosis, Neisseria gonorrhoeae, Pseudomonas aeruginosa, Saccharomyces 
cerevisae, Salmonella cholerasuis, Salmonella paratyphi, Salmonella typhi, Salmonella 
typhimurium, Staphylococcus aureus, Staphylococcus epidermidis, Streptococcus pneumoniae, 
Treponema pallidum, and Yersinia pestis or any species falling within the genera of any of the 

25 above species. 

8. A host cell containing the vector of Claim 6 or Claim 7. 

9. A purified or isolated nucleic acid consisting essentially of the coding sequence of 
one of SEQ ID NOs: 128-298. 

10. A fragment of the nucleic acid of Claim 8, said fragment comprising at least 10, at 
30 least 20, at least 25, at least 30, at least 50 or more than 50 consecutive bases of one of SEQ ID 

NOs: 128-298. 

11. A vector comprising a promoter operably linked to the nucleic acid of Claim 9 or 
Claim 10. 

12. A purified or isolated antisense nucleic acid comprising a nucleic acid sequence 
35 complementary to at least a portion of an intragenic sequence, intergenic sequence, sequences 

spanning at least a portion of two or more genes, 5 T noncoding region, or 3' noncoding region 
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within an operon comprising a proliferation-required gene whose activity or expression is inhibited 
by an antisense nucleic acid comprising one of SEQ ID NOs.: 1-127. 

13. A purified or isolated nucleic acid comprising a nucleic acid having at least 70% 
identity to a sequence selected from the group consisting of SEQ ID NOs.: 1-127, fragments 

5 comprising at least 25 consecutive nucleotides of SEQ ID NOs.: 1-127, the sequences 
complementary to SEQ ID NOs.: 1-127 and the sequences complementary to fragments comprising 
at least 25 consecutive nucleotides of SEQ ID NOs.: 1-127 as determined using BLASTN version 
2.0 with the default parameters. 

14. The nucleic acid of Claim 13, wherein said nucleic acid is from an organism 
10 selected from the group consisting of Aspergillus fumigatus. Bacillus anthracis, Campylobacter 

jejuni, Candida albicans, Chlamydia pneumoniae, Chlamydia trachomatus, Clostridium botulinum, 
Cryptococcus neoformans, Enterobacter cloacae, Enterococcus faecalis, Staphylococcus aureus, 
Haemophilus influenzae, Helicobacter pylori, Klebsiella pneumoniae, Mycobacterium leprae, 
Mycobacterium tuberculosis, Neisseria gonorrhoeae, Pseudomonas aeruginosa, Saccharomyces 
15 cerevisae, Salmonella cholerasuis, Salmonella paratyphi, Salmonella typhi, Salmonella 
typhimurium, Staphylococcus aureus, Staphylococcus epidermidis, Streptococcus pneumoniae, 
Treponema pallidum, and Yersinia pestis or any species falling within the genera of any of the 
above species. 

15. A vector comprising a promoter operably linked to a nucleic acid encoding a 
20 polypeptide whose expression is inhibited by an antisense nucleic acid comprising one of SEQ ID 

NOs.: 1-127. 

1 6. A host cell containing the vector of Claim 16. 

17. The vector of Claim 15, wherein said polypeptide comprises a polypeptide 
comprising a sequence selected from the group consisting of SEQ ID NOs: 299-469. 

25 18. A purified or isolated polypeptide comprising a polypeptide whose expression is 

inhibited by an antisense nucleic acid comprising one of SEQ ID NOs.: 1-127, or a fragment 
selected from the group consisting of fragments comprising at least 5, at least 10, at least 20, at least 
30, at least 40, at least 50, at least 60 or more than 60 consecutive amino acids of one of the said 
polypeptides. 

30 19. The polypeptide of Claim 18, wherein said polypeptide comprises a polypeptide 

comprising one of SEQ ID NOs.: 299-469 or a fragment comprising at least 5, at least 10, at least 
20, at least 30, at least 40, at least 50, at least 60 or more than 60 consecutive amino acids of a 
polypeptide comprising a sequence selected from the group consisting of SEQ ID NOs.: 299-469. 

20. A purified or isolated polypeptide comprising a polypeptide having at least 25% 

35 identity to a polypeptide whose expression is inhibited by a sequence selected from the group 
consisting of SEQ ID NOs.: 1-127, or at least 25% identity to a fragment comprising at least 5, at 
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least 10, at least 20, at least 30, at least 40, at least 50, at least 60 or more than 60 consecutive 
amino acids of a polypeptide whose expression is inhibited by a nucleic acid selected from the 
group consisting of SEQ ID NOs.: 1-127 as determined using FASTA version 3.0t78 with the 
default parameters. 

5 21 . The polypeptide of Claim 20, wherein said polypeptide has at least 25% identity to 

a polypeptide comprising one of SEQ ID NOs: 299-469 or at least 25% identity to a fragment 
comprising at least 5, at least 10, at least 20, at least 30, at least 40, at least 50, at least 60 or more 
than 60 consecutive amino acids of a polypeptide comprising one of SEQ ID NOs.: 299-469 as 
determined using FASTA version 3.0t78 with the default parameters. 
10 22. An antibody capable of specifically binding the polypeptide of one of Claims 18- 

21. 

. 23. A method of producing a polypeptide, comprising introducing a vector comprising 
a promoter operably linked to a nucleic acid encoding a polypeptide whose expression is inhibited 
by an antisense nucleic acid comprising one of SEQ ID NOs.: 1-127 into a cell. 
15 24. The method of Claim 23, further comprising the step of isolating said polypeptide. 

25. The method of Claim 23, wherein said polypeptide comprises a sequence selected 
from the group consisting of SEQ ID NOs.: 299-469. 

26. A method of inhibiting proliferation of a microorganism comprising inhibiting the 
activity or reducing the amount of a gene product whose expression is inhibited by an antisense 

20 nucleic acid comprising a sequence selected from the group consisting of SEQ ID NOs.: 1-127 or 
inhibiting the activity or reducing the amount of a nucleic acid encoding said gene product. 

27. The method of Claim 26, wherein said gene product comprises a polypeptide 
comprising a sequence selected from the group consisting of SEQ ID NOs.: 299-469. 

28. A method for identifying a compound which influences the activity of a gene 
25 product required for proliferation, said gene product comprising a gene product whose expression is 

inhibited by an antisense nucleic acid comprising a sequence selected from the group consisting of 

SEQ ID NOs.: 1-127, said method comprising: 

contacting said gene product with a candidate compound; and 
determining whether said compound influences the activity of said gene product 
30 29. The method of Claim 28, wherein said gene product is a polypeptide and said 

activity is an enzymatic activity. 

30. The method of Claim 28, wherein said gene product is a polypeptide and said 
activity is a carbon compound catabolism activity. 

31. The method of Claim 28, wherein said gene product is a polypeptide and said 
35 activity is a biosynthetic activity. 
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32. The method of Claim 28, wherein said gene product is a polypeptide and said 
activity is a transporter activity. 

33. The method of Claim 28, wherein said gene product is a polypeptide and said 
activity is a transcriptional activity. 

5 34. The method of Claim 28, wherein said gene product is a polypeptide and said 

activity is a DNA replication activity. 

35. The method of Claim 28, wherein said gene product is a polypeptide and said 
activity is a cell division activity. 

36. A compound identified using the method of Claim 28. 

10 37. The method of Claim 28, wherein said gene product is a polypeptide comprising a 

sequence selected from the group consisting of SEQ IDNOs.: 299-469. 

38. A method for identifying a compound or nucleic acid having the ability to reduce 
the activity or level of a gene product required for proliferation, said gene product comprising a 
gene product whose activity or expression is inhibited by an antisense nucleic acid comprising a 

1 5 sequence selected from the group consisting of SEQ ID NOs.: 1 - 1 27, said method comprising: 

(a) providing a target that is a gene or RNA, wherein said target comprises a 
nucleic acid encoding said gene product; 

(b) contacting said target with a candidate compound or nucleic acid; and 

(c) measuring an activity of said target. 

20 39. The method of Claim 38, wherein said target is a messenger RNA molecule and 

said activity is translation of said messenger RNA. 

40. The method of Claim 38, wherein said target is a messenger RNA molecule and 
said activity is transcription of a gene encoding said messenger RNA. 

41. The method of Claim 38, wherein said target is a gene and said activity is 
25 transcription of said gene. 

42. The method of Claim 38, wherein said target is a nontranslated RNA and said 
activity is processing or folding of said nontranslated RNA or assembly of said nontranslated RNA 
into a protein/RNA complex. 

43. The method of Claim 38, wherein said target gene or RNA encodes a polypeptide 
30 comprising a sequence selected from the group consisting of SEQ ID NOs.: 299-469. 

44. A compound or nucleic acid identified using the method of Claim 38. 

45. A method for identifying a compound which reduces the activity or level of a gene 
product required for proliferation of a microorganism, wherein the activity or expression of said 
gene product is inhibited by an antisense nucleic acid comprising a sequence selected from the 

35 group consisting of SEQ ID NOs.: 1-127, said method comprising the steps of: 
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(a) expressing a sublethal level of an antisense nucleic acid complementary to 
a nucleic acid encoding said gene product in a cell to reduce the activity or amount of said 
gene product in said cell, thereby producing a sensitized cell; 

(b) contacting said sensitized cell with a compound; and 

5 (c) determining whether said compound inhibits the growth of said sensitized 

cell. 

46. The method of Claim 45, wherein said determining step comprises determining 
whether said compound inhibits the growth of said sensitized cell to a greater extent than said 
compound inhibits the growth of a nonsensitized cell. 
10 47. The method of Claim 45, wherein said cell is selected from the group consisting of 

bacterial cells, fungal cells, plant cells, and animal cells. 

48. The method of Claim 45, wherein said cell is a Gram negative bacterium. 
• 49. The method of Claim 45, wherein said cell is an E. coli cell. 

50. The method of Claim 45, wherein said cell is from an organism selected from the 
15 group consisting of Aspergillus fumigatus, Bacillus anthracis, Campylobacter jejuni, Candida 

albicans, Chlamydia pneumoniae, Chlamydia trachomatus, Clostridium bolulinum, Cryptococcus 
neoformans, Enterobacter cloacae, Enterococcus faecalis, Haemophilus influenzae, Helicobacter 
pylori, Klebsiella pneumoniae, Mycobacterium leprae, Mycobacterium tuberculosis, Neisseria 
gonorrhoeae, Pseudomonas aeruginosa, Saccharomyces cerevisae, Salmonella cholerasuis, 
20 Salmonella paratyphi. Salmonella typhi, Salmonella typhimurium, Staphylococcus aureus, 
Staphylococcus epidermidis, Streptococcus pneumoniae, Treponema pallidum, and Yersinia pestis 
or any species falling within the genera of any of the above species. 

51. The method of Claim 45, wherein said antisense nucleic acid is transcribed from an 
inducible promoter. 

25 52. The method of Claim 51, further comprising the step of contacting said cell with a 

concentration of inducer which induces said antisense nucleic acid to a sublethal level. 

53. The method of Claim 45, wherein growth inhibition is measured by monitoring 
optical density of a culture growth solution. 

54. The method of Claim 45, wherein said gene product is a polypeptide. 

30 55. The method of Claim 54, wherein said polypeptide comprises a sequence selected 

from the group consisting of SEQ ID NOs.: 299-469. 

56. The method of Claim 45, wherein said gene product is an RNA. 

57. A compound identified using the method of Claim 45. 

58. A method for inhibiting cellular proliferation comprising introducing a compound 
35 with activity against a gene whose activity or expression is inhibited by an antisense nucleic acid 
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comprising a sequence selected from the group consisting of SEQ ID NOs.: 1-127 or a compound 
with activity against the product of said gene into a population of cells expressing said gene. 

59. The method of Claim 58, wherein said compound is an antisense nucleic acid 
comprising a sequence selected from the group consisting of SEQ ID NOs.: 1-127, or a 

5 proliferation-inhibiting portion thereof. 

60. The method of Claim 59, wherein said proliferation inhibiting portion of one of 
SEQ ID NOs.: 1-127 is a fragment comprising at least 10, at least 20, at least 25, at least 30, at least 
50 or more than 51 consecutive bases of one of SEQ ID NOs.: 1-127. 

61 . The method of Claim 58, wherein said population is a population selected from the 
10 group consisting of bacterial cells, fungal cells, plant cells, and animal cells. 

62. The method of Claim 58, wherein said population is a population of Gram negative 
bacteria. 

63. The method of Claim 58, wherein said population is a population of £ coli cells. 

64. The method of Claim 58, wherein said population is a population selected from the 
15 group consisting of Aspergillus fumigatus, Bacillus artthracis, Campylobacter jejuni, Candida 

albicans. Chlamydia pneumoniae, Chlamydia trachomatus, Clostridium botulinum, Cryptococcus 
neoformans, Enterobacter cloacae, Enterococcus faecalis, Haemophilus influenzae, Helicobacter 
pylori, Klebsiella pneumoniae, Mycobacterium leprae, Mycobacterium tuberculosis, Neisseria 
gonorrhoeae, Pseudomonas aeruginosa,Saccharomyces cerevisae, Salmonella cholerasuis, 
20 Salmonella paratyphi, Salmonella typhi, Salmonella typhimurium, Staphylococcus aureus, 
Staphylococcus epidermidis, Streptococcus pneumoniae, Treponema pallidum, and Yersinia pestis 
cells or cells from any species falling within the genera of any of the above species. 

65. The method of Claim 58, wherein said gene encodes a polypeptide comprising a 
sequence selected from the group consisting of SEQ ID NOs.: 299-469. 

25 66. A preparation comprising an effective concentration of an antisense nucleic acid 

comprising a sequence selected from the group consisting of SEQ ID NOs.: 1-127, or a 
proliferation-inhibiting portion thereof in a pharmaceutical^ acceptable carrier. 

67. The preparation of Claim 66, wherein said proliferation-inhibiting portion of one of 
SEQ ID NOs.: 1-127 comprises at least 10, at least 20, at least 25, at least 30, at least 50 or more 

30 than 50 consecutive bases of one of SEQ ID NOs.: 1-127. 

68. A method for inhibiting the activity or expression of a gene in an operon required 
for proliferation wherein the activity or expression of at least one gene in said operon is inhibited by 
an antisense nucleic acid comprising a sequence selected from the group consisting of SEQ ID 
NOs.: 1-127, said method comprising contacting a cell in a cell population with an antisense nucleic 

35 acid comprising at least a proliferation- inhibiting portion of said operon. 
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69. The method of Claim 68, wherein said antisense nucleic acid comprises a sequence 
selected from the group consisting of SEQ ID NOs.: 1-127 or a proliferation inhibiting portion 
thereof. 

70. The method of Claim 68, wherein said cell is contacted with said antisense nucleic 
5 acid by introducing a plasmid which expresses said antisense nucleic acid into said cell population. 

71 . The method of Claim 68, wherein said cell is contacted with said antisense nucleic 
acid by introducing a phage which expresses said antisense nucleic acid into said cell population. 

72. The method of Claim 68, wherein said cell is contacted with said antisense nucleic 
acid by expressing said antisense nucleic acid from the chromosome of cells in said cell population. 

10 73. The method of Claim 68, wherein said cell is contacted with said antisense nucleic 

acid by introducing a promoter adjacent to a chromosomal copy of said antisense nucleic acid such 

that said promoter directs the synthesis of said antisense nucleic acid. 

74. The method of Claim 68, wherein said cell is contacted with said antisense nucleic 

acid by introducing a retron which expresses said antisense nucleic acid into said cell population. 
15 75. The method of Claim 68, wherein said cell is contacted with said antisense nucleic 

acid by introducing a ribozyme into said cell-population, wherein a binding portion of said 

ribozyme is complementary to said antisense oligonucleotide. 

76. The method of Claim 68, wherein said cell is contacted with said antisense nucleic 

acid by introducing a liposome comprising said antisense oligonucleotide into said cell. 
20 77. The method of Claim 68, wherein said cell is contacted with said antisense nucleic 

acid by electroporation of said antisense nucleic acid. 

78. The method of Claim 68, wherein said antisense nucleic acid is a fragment 

comprising at least 10, at least 20, at least 25, at least 30, at least 50 or more than 50 consecutive 

bases of one of SEQ ID NOs.: 1-127. 
25 79. The method of Claim 68 wherein said antisense nucleic acid is an oligonucleotide. 

80. A method for identifying a gene which is required for proliferation of a 
microorganism comprising: 

(a) contacting a microorganism other than E. coli with a nucleic acid selected 
from the group consisting of SEQ ID NOs.: 1-127; 
30 (b) determining whether said nucleic acid inhibits proliferation of said 

microorganism; and 

(c) identifying the gene in said microorganism which is inhibited by said 
nucleic acid. 

81. The method of Claim 80, wherein said microorganism is a Gram negative 
35 bacterium. 
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82. The method of Claim 80 wherein said microorganism is selected from the group 
consisting of Aspergillus fumigates, Bacillus anthracis, Campylobacter jejuni, Candida albicans, 
Chlamydia pneumoniae, Chlamydia trachomatis, Clostridium botulimim, Cryptococcus 
neoformans, Enterobacter cloacae, Enterococcus faecalis, Haemophilus influenzae, Helicobacter 
5 pylori, Klebsiella pneumoniae, Mycobacterium leprae, Mycobacterium tuberculosis, Neisseria 
gonorrhoeae, Pseudomonas aeruginosa, Saccharomyces cerevisae, Salmonella cholerasuis, 
Salmonella paratyphi, Salmonella typhi, Salmonella typhimurium, Staphylococcus aureus, 
Staphylococcus epidermidis, Streptococcus pneumoniae, Treponema pallidum, and Yersinia pestis 
or any species falling within the genera of any of the above species. 
10 83. The method of Claim 80, further comprising introducing said nucleic acid into a 

vector functional in said microorganism prior to introducing said inhibitory nucleic acid into said 
microorganism. 

84. A method for identifying a compound having the ability to inhibit proliferation of a 
microorganism comprising: 

IS (a) identifying in a first microorganism a homolog of a gene or gene product 

present in a second microorganism which is different than said first microorganism, 
wherein the activity or level of said gene or gene product is inhibited by a nucleic acid 
comprising a sequence selected from the group consisting of SEQ ID NOs. 1 - 1 27 ; 

(b) identifying an inhibitory nucleic acid sequence which inhibits the activity 
20 of said homolog in said first microorganism; 

(c) contacting said first microorganism with a sublethal level of said inhibitory 
nucleic acid, thus sensitizing said first microorganism; 

(d) contacting the sensitized microorganism of step (c) with a compound; and 

(e) determining whether said compound inhibits proliferation of said sensitized 
25 microorganism. 

85. The method of Claim 84, wherein said determining step comprises determining 
whether said compound inhibits proliferation of said sensitized microorganism to a greater extent 
than said compound inhibits proliferation of a nonsensitized microorganism. 

86. The method of Claim 84 wherein step (a) comprises identifying a homologous 
30 nucleic acid to a gene or gene product whose activity or level is inhibited by a nucleic acid selected 

from the group consisting of SEQ ID NOs. 1-127 or a nucleic acid encoding a homologous 
polypeptide to a polypeptide whose activity or level is inhibited by a nucleic acid selected from the 
group consisting of SEQ ID NOs. 1-127 by using an algorithm selected from the group consisting 
of BLASTN version 2.0 with the default parameters and FASTA version 3.0t78 algorithm with the 
35 default parameters to identify said homologous nucleic acid or said nucleic acid encoding a 
homologous polypeptide in a database. 
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87. The method of Claim 84 wherein said step (a) comprises identifying a homologous 
nucleic acid or a nucleic acid encoding a homologous polypeptide by identifying nucleic acids 
which hybridize to said first gene. 

88. The method of Claim 84 wherein the step (a) comprises expressing a nucleic acid 
5 selected from the group consisting of SEQ ID NOs. 1-127 in said microorganism. 

89. The method of Claim 84, wherein said inhibitory nucleic acid is an antisense 
nucleic acid. 

90. The method of Claim 84, wherein said inhibitory nucleic acid comprises an 
antisense nucleic acid to a portion of said homolog. 

10 91. The method of Claim 84, wherein said inhibitory nucleic acid comprises an 

antisense nucleic acid to a portion of the operon encoding said homolog. 

92. The method of Claim 84, wherein the step of contacting the first microorganism 
with a sublethal level of said inhibitory nucleic acid comprises directly contacting said 
microorganism with said inhibitory nucleic acid. 
15 93. The method of Claim 84, wherein the step of contacting the first microorganism 

with a sublethal level of said inhibitory nucleic acid comprises expressing an antisense nucleic acid 
to said homolog in said microorganism. 

94. The method of Claim 84, wherein said gene product comprises a polypeptide 
comprising a sequence selected from the group consisting of SEQ ID NOs.: 299-469. 
20 95. A compound identified using the method of Claim 84. 

96. A method of identifying a compound having the ability to inhibit proliferation 
comprising: 

(a) contacting a microorganism other than £. coli with a sublethal level of a 
nucleic acid comprising a sequence selected from the group consisting of SEQ ID NOs. 1- 

25 127 or a portion thereof which inhibits the proliferation of E. coli, thus sensitizing said 

microorganism; 

(b) contacting the sensitized microorganism of step (a) with a compound; and 

(c) determining whether said compound inhibits proliferation of said sensitized 
microorganism. 

30 97. The method of Claim 96, wherein said determining step comprises determining 

whether said compound inhibits proliferation of said sensitized microorganism to a greater extent 
than said compound inhibits proliferation of a nonsensitized microorganism. 

98. A compound identified using the method of Claim 96. 

99. A method for identifying a compound having activity against a biological pathway 
35 required for proliferation comprising: 
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(a) sensitizing a cell by expressing a sublethal level of an antisense nucleic 
acid complementary to a nucleic acid encoding a gene product required for proliferation, 
wherein the activity or expression of said gene product is inhibited by an antisense nucleic 
acid comprising a sequence selected from the group consisting of SEQ ID NOs.: 1-127, in 

5 said cell to reduce the activity or amount of said gene product; 

(b) contacting the sensitized cell with a compound; and 

(c) determining whether said compound inhibits the growth of said sensitized 

cell. 

100. The method of Claim 99, wherein said determining step comprises determining 
10 whether said compound inhibits the growth of said sensitized cell to a greater extent than said 

compound inhibits the growth of a nonsensitized cell. 

101 . The method of Claim 99, wherein said cell is selected from the group consisting of 
bacterial cells, fungal cells, plant cells, and animal cells. 

102. The method of Claim 99, wherein said cell is a Gram negative bacterium. 
15 103. The method of Claim 99, wherein said Gram negative bacterium is E. colL 

104. The method of Claim 99, wherein said cell is selected from the group consisting of 
Aspergillus fumigatus, Bacillus anthracis, Campylobacter jejuni, Candida albicans, Chlamydia 
pneumoniae, Chlamydia trachomatus, Clostridium botulinum, Cryptococcus neoformans, 
Enterobacter cloacae, Enterococcus faecalis, Haemophilus influenzae, Helicobacter pylori, 

20 Klebsiella pneumoniae, Mycobacterium leprae, Mycobacterium tuberculosis, Neisseria 
gonorrhoeae, Pseudomonas aeruginosa, Saccharomyces cerevisae, Salmonella cholerasuis, 
Salmonella paratyphi, Salmonella typhi, Salmonella typhimurium, Staphylococcus aureus, 
Staphylococcus epidermidis, Streptococcus pneumoniae, Treponema pallidum, and Yersinia pestis 
or any species falling within the genera of any of the above species. 

25 1 05. The method of Claim 99, wherein said antisense nucleic acid is transcribed from an 

inducible promoter. 

106. The method of Claim 99, further comprising contacting the cell with an agent 
which induces expression of said antisense nucleic acid from said inducible promoter, wherein said 
antisense nucleic acid is expressed at a sublethal level. 
30 107. The method of Claim 99, wherein inhibition of proliferation is measured by 

monitoring the optical density of a liquid culture. 

108. The method of Claim 99, wherein said gene product comprises a polypeptide 
comprising a sequence selected from the group consisting of SEQ ID NOs.: 299-469 

109. A compound identified using the method of Claim 99. 

35 110. A method for identifying a compound having the ability to inhibit cellular 

proliferation comprising: 
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(a) contacting a cell with an agent which reduces the activity or level of a gene 
product required for proliferation of said cell, wherein said gene product is a gene product 
whose activity or expression is inhibited by an antisense nucleic acid comprising a 
sequence selected from the group consisting of SEQ ID NOs. : 1 - 1 27; 
5 (b) contacting said cell with a compound; and 

(c) determining whether said compound reduces proliferation of said contacted 

cell. 

111. The method of Claim 1 10, wherein said determining step comprises determining 
whether said compound reduces proliferation of said contacted cell to a greater extent than said 

10 compound reduces proliferation of cells which have not been contacted with said agent. 

1 12. The method of Claim 1 10, wherein said agent which reduces the activity or level of 
a gene product required for proliferation of said cell comprises an antisense nucleic acid to a gene 
or operon required for proliferation. 

113. The method of Claim 1 10, wherein said agent which reduces the activity or level of 
15 a gene product required for proliferation of said cell comprises a compound known to inhibit 

growth or proliferation of a microorganism. 

1 14. The method of Claim 1 10, wherein said cell contains a mutation which reduces the 
activity or level of said gene product required for proliferation of said cell. 

115. The method of Claim 114, wherein said mutation is a temperature sensitive 
20 mutation. 

116. The method of Claim 110, wherein said gene product comprises a polypeptide 
comprising a sequence selected from the group consisting of SEQ ID NOs.: 299-469 

117. A compound identified using the method of Claim 1 10. 

118. A method for identifying the biological pathway in which a proliferation-required 
25 gene or its gene product lies, wherein said gene or gene product comprises a gene or gene product 

whose activity or expression is inhibited by an antisense nucleic acid comprising a sequence 
selected from the group consisting of SEQ ID NOs.: 1-127, said method comprising: 

(a) expressing a sublethal level of an antisense nucleic acid which inhibits the 
activity of said proliferation-required gene or gene product in a cell; 
30 (b) contacting said cell with a compound known to inhibit . growth or 

proliferation of a microorganism, wherein the biological pathway on which said compound 
acts is known; and 

(c) determining whether said cell is sensitive to said compound. 

119. The method of Claim 118, wherein said determining step comprises determining 
35 whether said cell has a substantially greater sensitivity to said compound than a cell which does not 

express said sublethal level of said antisense nucleic acid and wherein said gene or gene product 
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lies in the same pathway on which said compound acts if said cell expressing said sublethal level of 
said antisense nucleic acid has a substantially greater sensitivity to said compound than said cell 
which does not express said sublethal level of said antisense nucleic acid. 

120. The method of Claim 118, wherein said gene product comprises a polypeptide 
5 comprising a sequence selected from the group consisting of SEQ ID NOs.: 299-469 

121. A method for determining the biological pathway on which a test compound acts 
comprising: 

(a) expressing a sublethal level of an antisense nucleic acid complementary to 
a proliferation-required nucleic acid in a cell, wherein the activity or expression of said 

10 proliferation-required nucleic acid is inhibited by an antisense nucleic acid comprising a 

sequence selected from the group consisting of SEQ ID NOs.: 1-127 and wherein the 
biological pathway in which said proliferation-required nucleic acid or a protein encoded 
by said proliferation-required polypeptide lies is known, 

(b) contacting said cell with said test compound; and 

15 (c) determining whether said cell is sensitive to said test compound. 

122. The method of Claim 121, wherein said determining step comprises determining 
whether said cell has a substantially greater sensitivity to said test compound than a cell which does 
not express said sublethal level of said antisense nucleic acid. 

1 23 . The method of Claim 121, further comprising: 

20 (d) expressing a sublethal level of a second antisense nucleic acid 

complementary to a second proliferation-required nucleic acid in a second cell, wherein 
said second proliferation-required nucleic acid is in a different biological pathway than said 
proliferation-required nucleic acid in step (a); and 

(e) determining whether said second cell does not have a substantially greater 

25 sensitivity to said test compound than a cell which does not express said sublethal level of 

said second antisense nucleic acid, wherein said test compound is specific for the biological 
pathway against which the antisense nucleic acid of step (a) acts if said second cell does not 
have substantially greater sensitivity to said test compound. 

124. A purified or isolated nucleic acid comprising a sequence selected from the group 
30 consisting of SEQ ID NOs.: 1-127. 

125. A compound which interacts with a gene or gene product whose activity or 
expression is inhibited by an antisense nucleic acid comprising one of SEQ ID NOs.: 1-127 to 
inhibit proliferation. 

126. A compound which interacts with a polypeptide whose expression is inhibited by 
35 an antisense nucleic acid comprising one of SEQ ID NOs.: 1 - 1 27 to inhibit proliferation. 

127. A method for manufacturing an antibiotic comprising the steps of: 
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screening one or more candidate compounds to identify a compound that reduces 
the activity or level of a gene product required for proliferation, said gene product 
comprising a gene product whose activity or expression is inhibited by an antisense nucleic 
acid comprising a sequence selected from the group consisting of SEQ ID NOs.: 1-127; and 
5 manufacturing the compound so identified. 

128. The method of Claim 127, wherein said screening step comprises performing any 
one of the methods of Claims 28, 38, 45, 96, 99 and 1 10. 

129. A method for inhibiting proliferation of a microorganism in a subject comprising 
administering a compound that reduces the activity or level of a gene product required for 

10 proliferation of said microorganism, said gene product comprising a gene product whose activity or 
expression is inhibited by an antisense nucleic acid comprising a sequence selected from the group 
consisting of SEQ ID NOs.: 1-127 to said subject. 

130. The method of Claim 129 wherein said subject is selected from the group 
consisting of vertebrates, mammals, avians, and human beings. 

15 131. The method of Claim 129, wherein said gene product comprises a polypeptide 

comprising a sequence selected from the group consisting of SEQ ID NOs.: 299-469. 
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SEQUENCE LISTING 



<110> ELITRA PHARMACEUTICALS, INC. 
FORSYTH, R. Allyn 
OHLSEN, Kari 
ZYSKIND, Judith 



<120> GENES ESSENTIAL FOR MICROBIAL PROLIFERATION AND ANTISENSE THERETO 
<130> ELITRA. 008VPC 

<150> US 60/164,415 
<151> 1999-11-09 

<160> 469 

<170> FastSEQ for Windows Version 4.0 

<210> 1 
<211> 341 
<212> DNA 

<213> Escherichia coli 
<400> 1 

atttataaat aagcattaac cctcattaat gcctgctact gctgattttt tccccgcgac 
atgccgtgtc gcggggattt ttttatccgg gctttacagc gtttgtgggc tcacttgatg 
acggcggaca tccaccggca tcccggaacg gacatccatc acatgttgca tcacttcagc 
gtcagtttgt gctgcatctt taaatgattg cattgcgcta ttcagcgtaa ttggcagcag 
ctgcggatcg tcatcaatct tctctgacag cggctgatgt acttcaacca gacgcgcacc 
gtttggttcg gcagagactt ttatcggagt gttgatgata t 

<210> 2 
<211> 190 
<212> DNA 

<213> Escherichia coli 
<400> 2 

aatgcctgct actgctgatt ttttccccgc gacatgccgg gtcgcgggga tttttttatc 
cgggctttac agcgtttgtg ggctcacttg atgacggcgg acatccaccg gcatcccgga 
acggacatcc atcacatgtt gcatcacttc agcgtcagtt tgtgctgcat ctttaaatga 
tagcattgcg 

<210> 3 
<211> 389 
<212> DNA 

<213> Escherichia coli 
<220> 

<221> misc_feature 
<222> (1) . . . (389) 
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<223> n = A,T,C or G 



<400> 3 

accctcatta atgcctgcta ctgctgattt tttccccgcg acatgccgtg tcgcggggat 60 

ttttttatcc gggctttaca gcgtttgtgg gctcacttga tgacggcgga catccaccgg 120 

catcccggaa cggacatcca tcacatgttg catcacttca gcgtcagttt gtgctgcatc 180 

tttaaatgat tgcattgcgc tattcagcgt aattggcagc agctgcggat cgtcatcaat 24 0 

cttctctgac agcggctgat gtacttcaac canacgcgca ccgtttggtt cggcagagac 300 

ttttatcgga gtgttgatga tattcacttt ggtgcctggg gtgacctggc taaanagtgt 360 

tttgatatcg tcatcccgca aacaaatac 389 



<210> 4 
<211> 382 
<212> DNA 

<213> Escherichia coli 
<220> 

<221> misc_feature 
<222> (1) . . . (382) 
<223> n = A, T,C or G 



<400> 4 

gtgagaaatt tttcagctga agagtaatca tcttgatggg gtttgtaatc aacattctga 60 

taaatatcat taatatattg tgcagcaatg tgtcggcata ctatacggtt ttggggtcgc 120 

cccggaaaat aacattgacc attaagttta attcttggga attctttttc ttttctatct 180 

gattttaggt ttaattcttg tttatcaaga gtgtctgagc gttgaggtaa aatttccatt 240 

aggtattttt caaatattgg ggcaatggtt tgagtaagtg tcgtgttttc gtttttatta 300 

attatcgtaa tttctttctt atcagcttgt aagacggacg tggaaaattt aatgacaaaa 360 

tccttgntgc taaagatatg ac 382 



<210> 5 
<211> 199 
<212> DNA 

<213> Escherichia coli 



<400> 5 

ataacattga ccattaagtt taattcttgg gaattctttt tcttttctat ctgattttag 60 

gtttaattct tgtttatcaa gagtgtctga gcgttgaggt aaaatttcca ttaggtattt 120 

ttcaaatatt ggggcaatgg tttgagtaag tgtcgtgttt tcgtttttat taattatcgt 180 

aatttctttc ttatcagct 199 



<210> 6 
<211> 338 
<212> DNA 

<213> Escherichia coli 



<400> 6 

gtctggttct gaccgataga acccggaacg cggtgagaca aggagttacc gtgagtagcg 60 

tcctgggtac ggaagttcca gcgcttaacg gtacctgcga aacctttacc tttagaggtg 120 

ccagttacgt caactttttt aacgtcagca aacagttcaa cgctaatgct ctgacctaca 180 

gtgaactctt cgccttcagc caggcggaat tcccacagac cacggccagc ttctacgcca 240 

gctttagcga agtggccagc ttcaggcttg gtcacacggt tagctttttt agcaccggtg 300 

gtcacctgaa tagcacggta gccatcgtta gccaggtc 338 

<210> 7 
<211> 839 
<212> DNA 

<213> Escherichia coli 



<220> 



-2- 



WO 01/34810 



PCT/US00/30950 



<221> misc_feature 
<222> (1)...(839) 
<223> n « A, T,C or G 

<400> 7 

naccangant gggacccaaa tgaaaattga actgaagatt acaatcaatt taacancaag 60 

ttacacaacc aattcagact ccgccagccc accaatcatg attggacggt gtaaggacaa 120 

caccaacaaa aacaggaagt tagaagtctn agcaaaacac cgaccagacg gtgaggagac 180 

ataaaaggat acgcaaagga gccgcggctc ctggtaacat gaaagcccac agatgtgggc 240 

tttttcgttg atggtcagaa cgaccagttc acaccagcta ccgcgttcca cggggattcc 300 

acaccggcac tatggctata ccccacccca agatgcccgc ttaacgtact gctgaatgag 360 

gctttaatac ctgcctggta tattccacgt ctgcccgaca aataattgac gaaattaccg 420 

ncactattca ctttcacccg gttatcatcg acaaattctt tgcgcacaag ccgncttcag 4 80 

ccacggntca acttccatac cgntccccag acgcatgttg taactcaacg ttgcgcccag 540 

ttcacgatat atactgcggg tatcgactga tttcgatttc atgccattgg ataaatgata 600 

ttcggggtta tcagcggtga aacccgttna cgatgcatac ngggtcaggg ttcaattacc 660 

atcggtaaat cgcattcccg gtttcaatgt gaccgccaan cccgttgctg tnggtaactg 720 

gcattggcgg gttcancggt ggtaatttta acctggtacg ntacttttaa aacgggttaa 780 

gctttaccga anaccgtnca nnattggaaa cccnctttta atggttnccc aactnggnn 839 

<210> 8 
<211> 196 
<212> DNA 

<213> Escherichia coli 
<400> 8 

cctctacctt agaactgaag gccagcttca cgggcagcat ctgccagtgc ctggacacga 60 
ccatgatatt ggaacccgga acggtcaaag gatacatctt tgatgccttt ttccagagcg 120 
cgttcagcga cagctttacc cacagctgca gccgcgtctt tgttaccggt gtacttcagt 180 
tgttcagcga tagctt 196 

<210> 9 
<211> 472 
<212> DNA 

<213> Escherichia coli 
<400> 9 

attacctgtg cgtaaatgtg acgcggggta cgatgtacca ccaggcgagt tgcgcccagc 60 
tcctggagct tgcggcgtgc gcgggtcgca cgacggatac gagcagattt cttatccata 120 
gtgttacctt acttcttctt agcctctttg gtacgcacga cttcgtcggc gtaacgaaca 180 
cccttgcctt tataaggctc aggacgacgg taggcgcgca gatccgctgc aacctggccg 240 
atcacctgct tatcagcgcc tttcagcacg atttcagtct gagtcggaca ttcagcagtg 300 
atacccgcag gcagctgatg gtcaacagga tgagagaaac ccagagacag gttaatcaca 360 
ttgcctttaa ccgctgcacg gtaacctaca ccaaccagct gcagcttctt agtgaagcct 420 
tcggtaacac cgataaccat tgagttcagc agggcacgcg cggtaccagc ct 472 

<210> 10 
<211> 165 
<212> DNA 

<213> Escherichia coli . 
<400> 10 

cggcatttca cagcattact tggtgatacg agtctgcgcg tctttcaggg cttcatcgac 60 
agtctgacga ccgctggcgg cgttgatcac cgcagtacgc acggcatacc agaaagcgga 120 
catctgcggg atgttcggca tgatttcacc tttctgggcg ttttc 165 

<210> 11 
<211> 328 
<212> DNA 

<213> Escherichia coli 
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<220> 

<221> misc_feature 
<222> (1) . . . (328) 
<223> n = A,T,C or G 

<400> 11 

cgcgtggaga gtcatcagtt gcacctgatc cagctcttct tcactctcac cacgctcnat 60 

catgtcgcgc aaagtaaagc gcgtcaccac ctgggtgagc gtcatcggct catccagttc 120 

actgccttcc agcatctccg tcatccagct aaacagttgg ttgacgttct tcatgcgcat 180 

ttcggcggct ttcgggctgg gcgatgtttc gtacagccag gattcataat ccatgccatg 24 0 

gatcagatca cgcaccgcgg caatcggctc ccgctccgcc agacgctgga tttctgccaa 300 

ccagtgagtg aagcgggtca atgcttca » 328 

<210> 12 
<211> 332 
<212> DNA 

<213> Escherichia coli 
<4O0> 12 

cacgcaatga gttgaatatg acgcatttta tttgtagttt attatttttt ttatttacta 60 
ttttctgtag cgaaactaat atccagtgtc cgccagtatt tatggggaat aactggactt 120 
cattcggcca tgatggatcg ctcttaatat tttcattgag agcgtcttcc aggtgagttc 180 
tggtggagta ggggacgtag agaccagtag gataattgac ttgagtattt tttgcattag 240 
cggcgatctt ctgaagatag caagctattt catctgtggt taattgacga tttttatccc 300 
agaagtttgt accatcactc agttcattat tc 332 

<210> 13 
<211> 132 
<212> DNA 

<213> Escherichia coli 
<400> 13 

ccgataattg tcattaattt cgaaggggag aatagttttt tgaacaggag atcatctgaa 60 
tcatgtgatt tgagggggga gttggttgcg atatgtgcca gattatataa ttttaaacaa 120 
gcatttttga ta 132 

<210> 14 
<211> 265 
<212> DNA 

<213> Escherichia coli 
<400> 14 

cccatgagtt cagaattaaa actattaatg acacgcttaa gtgaattttc tttatcatag 60 
cctagtgcac gcaatgagtt gaatatgacg cattttattt gtagtttatt atttttttta 120 
tttactattt tctgtagcga aactaatatc cagtgtccgc cagtatttat ggggaataac 180 
tggacttcat tcggccatga tggatcgctc ttaatatttt cattgagagc gtcttccagg 240 
tgagttctgg tggagtaggg gacgt 265 

<210> 15 
<211> 665 
<212> DNA 

<213> Escherichia coli 
<220> 

<221> misc_feature 
<222> (1) . . . (665) 
<223> n = A, T,C or G 

<400> 15 
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ctgtcagcat ggttttttaa taacgtatag ggggcgctgt cagaattgct ttccctttgt 60 

tcaatcactt cctgggctgc catgcacaca aatgcaccgc aactttggct taagtattgc 120 

tgaagatcgg catgtaaaaa tattatctct ggttcattta aatgaacttt tatattatta 180 

ttcgacattt ctcccatgag ttcagaatta aaactattaa tgacacgctt aagtgaattt 240 

tctttatcat agcctagtgc acgcaatgag ttgaatatga cgcattttat ttgtagttta 300 

ttattttttt tatttactat tttctgtagc gaaactaata tccagtgtcc gccagtattt 360 

atggggaata actggacttc attcggccat gatggatcgc tcttaatatt ttcattgaga 420 

gcgtcttcca ggtgagttct ggtggagtag gggacgtaga gaccagtaag ataattgact 480 

tgagtatttt ttgcattagc ggcgatcttc tgaagatagc aagctatttc atctgtggtt 540 

aattgacgat ttttatcccc agaagtttgt accatcactc agtcattatt ctcatcattt 600 

aaaagccctt tgaaagcgct ggttttcaaa tataagaaga ggggatccgc gaccctgcag 660 

ccaan 665 

<210> 16 
<211> 130 
<212> DNA 

<213> Escherichia coli 
<400> 16 

agaggaattt catcaccgat aattgtcatt aatttcgaag gggagaatag ttttttgaac 60 
aggagatcat ctgaatcatg tgatttgagg ggggagttgg ttgcgatatg tgccagatta 120 
tataatttta 130 

<210> 17 
<211> 246 
<212> DNA 

<213> Escherichia coli 
<220> 

<221> misc_feature 
<222> (1) . . . (246) 
<223> n = A,T,C or G 

<400> 17 

cttcctgggc tgccatgcac acaaatgcac cgcaactttg gcttaagtat tgctgaagat 60 

cggcatgtaa aaatattatc tctggttcat ttaaatgaac ttttatatta ttattcgaca 120 

tttctcccat gagttcagaa ttaaaactat taatgacacg cttaagtgaa ttttctttat 180 

catagcctag tgcacgcnnt nnnttgaata tgacgcattt tatttgtagn tnattattnt 240 

ttttat " 246 

<210> 18 
<211> 208 
<212> DNA 

<213> Escherichia coli 
<400> 18 

aatgacacgc ttaagtgaat tttctttatc atagcctagt gcacgcaatg agttgaatat 60 
gacgcatttt atttgtagtt tattattttt tttatttact attttctgta gcgaaactaa 120 
tatccagtgt ccgccagtat ttatggggaa taactggact tcattcggcc atgatggatc 180 
gctcttaata ttttcattga gagcgtct 208 

<210> 19 
<211> 159 
<212> DNA 

<213> Escherichia coli 
<400> 19 

tatgtaagcg gcaaacgaat ggggtaaaca tgccgatttt ttagtaaata tttcgagagg 60 
gatatgtttc taatgctaag aaaaaaggtg ccgtagcacc tttttaatag agaggttttg 120 
ttaccacaca gcagccagca gcgtatgcga gtccggtac 159 
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<210> 20 
<211> 294 
<212> DNA 

<213> Escherichia coli 
<220> 

<221> misc_feature 
<222> (1) . . . (294) 
<223> n = A, T,C or G 

<400> 20 

cggtttcaat tttcaggaac atccacgggg ganggtaaac gaatacgtgg gnaattttgg 60 

tggattttca tttgcacttg nctcttcgat cgccttaaca aaatgctgct tgccttgctt 120 

gcgccggtgc gcttctacng gattgnccgg cgcgatagaa aatcgcttgc ctaagcccgn 180 

cccctgcaca acctggggtn tattcacttg cgccaggttt tcttgggcgt cacgcccgcc 240 

aaccggcaaa agactggcga tgtcccgatg gcaatacccg cttttaacgc tttt 294 

<210> 21 
<211> 129 
<212> DNA 

<213> Escherichia coli 
<400> 21 

tcgggttcaa ttttcaagga acatccagcg gggaggtaaa cgatacgtgg taattttggt 60 
gattttcatt gcactgcctc tcgatacgcc ttaacaaatg ctgctgcctg ctgcgcggtg 120 
cgctctacg 129 

<210> 22 
<211> 151 
<212> DNA 

<213> Escherichia coli 
<400> 22 

ctcctgtatc tatattctaa ttaaaaagtt aaaaacgtta acggcttatg cgtaccgcag 60 
gcacgcgatt accaggccca gatcgccttt ggaaaccatg gctttcggac gcaggtgacg 120 
tttacgtttg gtcgcttttt tggtcagaat g 151 

<210> 23 
<211> 579 
<212> DNA 

<213> Escherichia coli 
<220> 

<221> misc_feature 
<222> (1) . . . (579) 
<223> n = A,T,C or G 

<400> 23 

ccaggatctt acggtcgatt tcaacagagg cttttttcag gccattgatg aatttgctgt 60 

aagaaatacc gttctgacgt gctgctgcgt tgatacgcgc aatccacagt tgacggaact 120 

gacgcttacg ttgacgacgg tcacggtaag catactgacc agctttgata acagcctgga 180 

aggcaacgcg gtatacgcga gaacgcgcac cgtagtagcc tttagcttgt ttcaaaattt 240 

tcttgtgacg tgcacgtgca ataacaccac gttttacgcg agccatatgt gctctcctgt 300 

atctatattc taattaaaaa gttaaaaacg ttaacggctt atgcgtacgg caggcacgcg 360 

attaccaggc ccagatcgcc tttggaaacc atggctttcg gacgcaggtg acgtttacgt 420 

ttggtcgctt ttttggtcag aatgtgacgc aggttagcgt gcttgtgctt aaaaccacct 480 

ttaccggttt ttttgaagcg cttagcagca ccgcgtacgg tcttaatttt tggcatttta 540 

ataacttnca cttcgcattg gtaataaacg aaacaaagg 579 
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<210> 24 
<211> 222 
<212> DNA 

<213> Escherichia coli 
<220> 

<221> misc_feature 
<222> (1) . . . (222) 
<223> n = A,T, C or G 

<400> 24 

gagggtgcaa caggggtgag tagaatggtt ctttattttc atggtgtang gccttctgtn 60 

gttngaggac agttttaata aagttacaat caccgcgata aacgtaacca atttttagca 120 

actaaacagg ggaaaacaat tacagatttt tatctltcga ttacgatttt tggtttattt 180 

cttgatttat gaccgagatc ttacttttgt tgcgcaattg ta 222 

<210> 25 
<211> 278 
<212> DNA 

<213> Escherichia coli 
<400> 25 

ggagagtttg cagtcattta aaaggtaaca tgacaatgca tgatgaatat aacaacatat 60 
gatgttatgt aatgtgggag gaaagatcac aaaatttcgc acaggatcgc gctgtggcta 120 
atggatgtag ttatcaaatt gaatttaaag tgaaaatatt tttacgggcg ggggcaagaa 180 
ggacatataa acaaatacgc cctcggaaaa tccagagggc gtcgggcaat taaaccggtg 240 
ttagccgatt tctgtcagag acttactgtg cagtagga 278 

<210> 26 
<211> 156 
<212> DNA 

<213> Escherichia coli 
<220> 

<221> misc_feature 
<222> (1) . . . (156) 
<223> n = A,T,C or G 

<400> 26 

tgaaatncga ccgaaggtag tttcggaaac agtcagcgcg ctctgcgcgt ctttcaatac 60 
taattccatt gctatctcct tacgccttca cagctggttt aacgatcagg tcgctaccgg 120 
ttgcacccgg gacagcacct ttaaccagca gcaggt 

<210> 27 
<211> 143 
<212> DNA 

<213> Escherichia coli 
<220> 

<221> misc_feature • 
<222> (1) . . . (143) 
<223> n « A,T,C or G . 

<400> 27 

gcgcgtcttt caatactaat tccattgcta tctccttacg ccttcacagc tggtttaacg 
atcaggtcgc tccggntgcc ccgggacagc acctttaacc agcagcaggt tgcgctcagc 
gtcaacgcgt actacgtcaa ggc 

<210> 28 
<211> 266 



60 
120 
143 
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<212> DNA 

<213> Escherichia coli 
<400> 28 

gcttcgttga aatcacgacc gaaggtagtt tcggaaacag tcagcgcgct ctgcgcgtct 60 
ttcaatacta attccattgc tatctcctta cgccttcaca gctggtttaa cgatcaggtc 120 
gctaccggtt gcacccggga cagcaccttt aaccagcagc aggttgcgct cagcgtcaac 180 
gcgtactacg tcaaggctct gaacggttac acgttcgtta cccatctgac ctgccatttt 24 0 

cttgcctttg aacactttgc ccggag 266 

<210> 29 
<211> 535 
<212> DNA 

<213> Escherichia coli 
<400> 29 

gcataagcaa caacaacctg gtgaaccagc gcttcgttga aatcacgacc gaaggtagtt 60 
tcggaaacag tcagcgcgct ctgcgcgtct ttcaatacta attccattgc tatctcctta 120 
cgccttcaca gctggtttaa cgatcaggtc gctaccggtt gcacccggga cagcaccttt 180 
aaccagcagc aggttgcgct cagcgtcaac gcgtactacg tcaaggctct gaacggttac 24 0 

acgttcgtta cccatctgac ctgccatttt cttgcctttg aacactttgc ccggagtctg 300 
gttctgaccg atagaacccg gaacgcggtg agacaaggag ttaccgtgag tagcgtcctg 360 
ggtacggaag ttccagcgct taacggtacc tgcgaaacct ttacctttag aggtgccagt 420 
tacgtcaact tttttaacgt cagcaaacag ttcaacgcta atgctctgac ctacagtgaa 4 80 

ctcttcgcct tcagccaggc ggaattccca cagaccacgg ccagcttcta cgcca 535 

<210> 30 
<211> 223 
<212> DNA 

<213> Escherichia coli 
<400> 30 

ttcaacgcta atgctctgac ctacagtgaa ctcttcgcct tcagccaggc ggaattccca 60 
cagaccacgg ccagcttcta cgccagcttt agcgaagtgg ccagcttcag gcttggtcac 120 
acggttagct tttttagcac cggtggtcac ctgaatagca cggtagccat cgttagccag 180 
gtctttaacc tgagtaacgc ggtttgcttc aacttcgatt acg 223 

<210> 31 
<211> 225 
<212> DNA 

<213> Escherichia coli 
<220> 

<221> misc_£eature 
<222> (1) . . . (225) 
<223> n = A,T,C or G 

<400> 31 

cgctctgcgc gtctttcaat actaattcca ttgctatctc cttacgcctt cacagctggt 60 

ttaacgatca ggtcgctacc ggntgcaccc gggacagcac ctttaaccaa cagcangntg 120 

cgctcaacgt caacgcgtac tacgtcaagg ctctgaacgg gtacacgttc gttacccatn 180 

tgacctgnca ttttcttgcc tttgaacact ttgcccggag tctgg 225 

<210> 32 
<211> 341 
<212> DNA 

<213> Escherichia coli 
<400> 32 

tgctatctcc ttacgccttc acagctggtt taacgatcag gtcgctaccg gttgcacccg 60 
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ggacagcacc tttaaccagc agcaggttgc gctcagcgtc aacgcgtact acgtcaaggc 120 

tctgaacggt tacacgttcg ttacccatct gacctgccat tttcttgcct ttgaacactt 180 

tgcccggagt ctggttctga ccgatagaac ccggaacgcg gtgagacaag gagttaccgt 240 

gagtagcgtc ctgggtacgg aagttccagc gcttaacggt acctgcgaaa cctttacctt 300 

tagaggtgcc agttacgtca acttttttaa cgtcagcaaa c 341 



<210> 33 
<211> 176 
<212> DNA 

<213> Escherichia coli 



<400> 33 

aacaacaacc tggtgaacca gcgcttcgtt gaaatcacga ccgaaggtag tttcggaaac 60 

agtcagcgcg ctctgcgcgt ctttcaatac taattccatt gctatctcct tacgccttca 120 

cagctggttt aacgatcagg tcgctaccgg ttgcacccgg gacagcacct ttaacc 17 6 

<210> 34 
<211> 241 
<212> DNA 

<213> Escherichia coli 



60 



<400> 34 

tcgttgaaat cacgaccgaa ggtagtttcg gaaacagtca gcgcgctctg cgcgtctttc 

aatactaatt ccattgctat ctccttacgc cttcacagct ggtttaacga tcaggtcgct 120 

accggttgca cccgggacag cacctttaac cagcagcagg ttgcgctcag cgtcaacgcg 180 

tactacgtca aggctctgaa cggttacacg ttcgttaccc atctgacctg ccattttctt 24 0 

g 241 



<210> 35 

<211> 224 

<212> DNA 

<213> Escherichia coli 
<220> 

- <221> misc__feature 

<222> (1) . . . (224) 

<223> n = A, T, C or G 



<400> 35 

tgntctaatc tgattacgaa aaagatatgt tgcgggaggc gttgcctccc caacatataa 60 

gtggctccct caagccactt cctttagaag cacaaccttg cttctaacta tataaacctt 120 

ctgttatata ttacccttta tttttggggg cgtctcaacg ccccattttt aataattttt 180 

agtaaacaat tggcatatta attagagtta ttaacaacga tatc 224 

<210> 36 
<211> 413 
<212> DNA 

<213> Escherichia coli 



<400> 36 

aacatcctct tgtgtgaata aaacaaccgg accccatcga ggaacggagt ccggtgtcat 60 

attaaaagcc cgaaaatttt actcattttt gcgggaattg caatcaacag ttgctaactc 120 

tgctgtaaaa ggccgtcggc ggtgcagcca gtttggtgcc ggagtgcgcg cagtcaccgg 180 

agcgtacacg cagtacgtga ggatgacgag cacatcccgg tgccaaaatg gcaaacaagc 240 

caggccgatt agcgaccagg gaagcctggg ggcatcatac ccttcatgct tctcatcatc 300 

ttcgccattc cgcccttctt cattttcttc atcatgcgct gcatgtcgtc gaactgtttc 360 

agaagacggt taacgtcctg cacctgcata ccgcaaccgg cagcaatacg gcg 413 



<210> 37 
<211> 509 
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<212> DNA 

<213> Escherichia coli 
<400> 37 

cagcgacaac aacctggtag aacggacgct ttttagcgcc gtgacgtgct aaacgaatag 60 

ttaccataac atcctcttgt gtgaataaaa caaccggacc ccatcgagga acggagtccg 120 

gtgtcatatt aaaagcccga aaattttact catttttgcg ggaattgcaa tcaacagttg 180 

ctaactctgc tgtaaaaggc cgtcggcggt gcagccagtt tggtgccgga gtgcgcgcag 240 

tcaccggagc gtacacgcag tacgtgagga tgacgagcac atcccggtgc caaaatggca 300 

aacaagccag gccgattagc gaccagggaa gcctgggggc atcataccct tcatgcttct 360 

catcatcttc gccattccgc ccttcttcat tttcttcatc atgcgctgca tgtcgtcgaa 420 

ctgtttcaga agacggttaa cgtcctgcac ctgcataccg caaccggcag caatacggcg 480 

tttacgcgaa cctttgatga tttctggct 509 

<210> 38 
<211> 149 
<212> DNA 

<213> Escherichia coli 
<400> 38 

gatttttacc ggattaattt taaacatggt ctgatttctt ttattgagct agtcaaaatg 60 
cggtgataag agcgggattg tcccaatcca cgctcttttt tatagagaag atgacgctaa 120 
attggccaga tattgtcgat gataatttg 149 

<210> 39 
<211> 118 
<212> DNA 

<213> Escherichia coli 
<400> 39 

gtttcatccc tggcaagcag aacggctttt tcgttattta tatcgggaga attaatatga 60 
aactgcaata aattctttaa tgtgatacaa gcctgggcgt tcttttttcc ttcaattg 118 

<210> 40 
<211> 147 
<212> DNA 

<213> Escherichia coli 
<400> 40 

gcagaacggc tttttcgtta tttatatcgg gagaattaat atgaaactgc aataaattct 60 
ttaatgtgat acaagcctgg gcgttctttt ttccttcaat tgcgacaaat gtaatacttt 120 
ggttaatgat tggttggtcg tttacat 147 

<210> 41 
<211> 224 
<212> DNA 

<213> Escherichia coli 
<400> 41 

caattgccga gcgtttcatc cctggcaagc agaacggctt tttcgttatt tatatcggga 60 
gaattaatat gaaactgcaa taaattcttt aatgtgatac aagcctgggc gttctttttt 120 
ccttcaattg cgacaaatgt aatactttgg ttaatgattg gttggtcgtt tacataaaca 180 
ctgacgtcgt aaacacccgg tatagcgggg ttaccttcac tgta 224 

<210> 42 

<211> 506 

<212> DNA 

<213> Escherichia coli 

<400> 42 
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tgaggcaatt gccgagcgtt tcatccctgg caagcagaac ggctttttcg ttatttatat 60 

cgggagaatt aatatgaaac tgcaataaat tctttaatgt gatacaagcc tgggcgttct 120 

tttttccttc aattgcgaca aatgtaatac tttggttaat gattggttgg tcgtttacat 180 

aaacactgac gtcgtaaaca cccggtatag cggggttacc ttcactgtac cggctgagat 240 

caatattaga tgcattctgc cccatcagga aggtatggtc atattcaacg agttcagcac 300 

tgaaagcagt attgcaatac agcagcgcgc aaaacgtggc gatacgggtc agatgatgat 360 

aatttttagt atattctata gtcacgctat gcttcctgcg gaattatatc cttgcctgat 420 

tacagcctgg cattaccttc aattgcgcca ccaaagtcat taatggcgta aaaatgcact 480 

tttgcagaat tcgctttgcc attaag 506 

<210> 43 
<211> 184 
<212> DNA 

<213> Escherichia coli 



180 
184 



<400> 43 

caggcttgag gaacgtctat atccagacgt tgatcgttaa cgtcataacg aacagaagcc 60 
tgagggataa tttccgtcaa attgaggcaa ttgccgagcg tttcatccct ggcaagcaga 120 
acggcttttt cgttatttat atcgggagaa ttaatatgaa actgcaataa attctttaat 
gtga 

<210> 44 
<211> 327 
<212> DNA 

<213> Escherichia coli 



<220> 

<221> misc_feature 
<222> (1) . . . (327) 
<223> n = A,T,C or G 



<400> 44 

ggctggaggt gatgaccgnc atttccctga tgccagcagc ctgttcggca gataactgca 60 

atgtacgatc aggttctgtg gtttcggtgg tagaaaagcg caaaacatcg ctgctttncg 120 

tcaaccgcca gtatggattg tgcgatccga gaaagttgta caacgttgtg gactccctaa 180 

cggttttcat gatgcatatc ctccggttaa cagcggagtc cgcggttaaa taaaaggaac 240 

aacatgactt acatgaaatt aacggcggct aaacgcattg ntttttttat atatttattt 300 

gtaatccagt tttggaaaaa cgccagt 327 

<210> 45 
<211> 235 
<212> DNA 

<213> Escherichia coli 



<400> 4b 

cgctgggcat caaattcaga catcaaggac atccttatag caaaaggggg aaaatgaata 60 

atgcaaaaat tgccgctaaa agagaagtgt ttaacagcaa cggctaatta tcatccagga 120 

atacgatata taatgacggg atatagcgct aagtatatat attcatctac ttatgcgcgc 180 

ttcagatagc gtttatacca gcgttcgaag gcgac 215 

<210> 46 
<211> 121 
<212> DNA 

<213> Escherichia coli 



<400> 46 

aaataacttt ccgcatagct taaaaaggag ggtaaaatgg caagagatga ataacaaatt 
ccggccagac accaggtata cctctgcgga atatccgtag tagtgggtat tttcataatt 
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<210> 47 
<211> 160 
<212> DNA 

<213> Escherichia coli 



<400> 47 

tgcaaggcag gcgctgaaag aggcgaatag tctacctgga ttatggtgaa ttgctccgcc 60 

agatgttaca gggtttattc actcatcgca tcgcgccaga tcttgtcaac gatacgttgc 120 

gccagcacgg cctgttcgcc ggcggtttgc ggaactgggg 160 



<210> 48 
<211> 195 
<212> DNA 

<213> Escherichia coli 



<400> 48 

tgcgggggaa tttaaactgt gaaaagattc acaaatagct gcggttagga atgtgatatc 60 

cgcagcttat caggtgaagt aataattatt ctcaataatt aactggagag gggaatatta 120 

acgatagaaa ataagataaa caatcatgcc gatagataaa attccgaagc cgatatatgc 180 

caccatataa gcgaa 195 



<210> 49 
<211> 167 
<212> DNA 

<213> Escherichia coli 



<400> 49 

gtgaaaagat tcacaaatag ctgcggttag gaatgtgata tccgcagctt atcaggtgaa 60 

gtaataatta ttctcaataa ttaactggag aggggaatat taacgataga aaataagata 120 

aacaatcatg ccgatagata aaattccgaa gccgatatat gccacca 167 



<210> 50 
<211> 226 
<212> DNA 

<213> Escherichia coli 



<400> 50 

ataatgtact gatattaagg ctaagcgtgc gggggaattt aaactgtgaa aagattcaca 60 

aatagctgcg gttaggaatg tgatatccgc agcttatcag gtgaagtaat aattattctc 120 

aataattaac tggagagggg aatattaacg atagaaaata agataaacaa tcatgccgat 180 

agataaaatt ccgaagccga tatatgccac catataagcg aaaaag 226 



<210> 51 
<211> 185 
<212> DNA 

<213> Escherichia coli 



<400> 51 

ccgaatgacc tgtgcggata gttaattggc atgccgacgc tggagtaggc atccatcatc 60 

tgttctgagg ttatcacttc aatctggtgc gggtaggtat ccagccggta gagtttcgcc 120 

acccggtcta tctctgccag ataaacatcc agcaggtcga acgtccagtc gggtccatcg 180 

ctcaa 185 



<210> 52 
<211> 187 
<212> DNA 

<213> Escherichia coli 



<400> 52 

ttctgattaa cagcggactt aatcttgagc aaagtcaatg ggcgacaatg caaattataa 60 
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tgctattaat tttatactgt tcaagcatgt tatgtctggc tgaagccaat ttagccacag 120 
taaccgttaa tgatggcgaa acttcatcaa tattaattcg taaagcatct atctctgtat 180 
cattaat !8 7 



<210> 53 
<211> 337 
<212> DNA 

<213> Escherichia coli 



<400> 53 

aagagggttt 

attcagcgct 

cagcggactt 

tttatactgt 

tgatggcgaa 

gcttctgtaa 



agttctctcg 
tccgttacac 
aatactgagc 
tcaagcatgt 
acttcatcaa 
atgccagctg 



gtaatgcgct 
cactcaactt 
aaagtacaat 
tatgtctggc 
tattaattcg 
cgctgattgt 



aacctcaaca 
tttgatctct 
ggcgacaatg 
tgaagccaat 
taaagcatct 
ctcacta 



ggctctcggc 
cgtaatcccg 
caaattataa 
ttagccacag 
atctctgtat 



tatcgttata 
ttctgattaa 
tgctattaat 
taaccgttaa 
cattaataat 



60 
120 
180 
240 
300 
337 



<210> 54 
<211> 192 
<212> DNA 

<213> Escherichia coli 



<400> 54 

ctgagcaaag tacaatgggc gacaatgcaa attataatgc . tattaatttt atactgttca 60 

agcatgttat gtctggctga agccaattta gccacagtaa ccgttaatga tggcgaaact 120 

tcatcaatat taattcgtaa agcatctatc tctgtatcat taataatgct tctgtaaatg 180 

ccagctgcgc tg 192 

<210> 55 
<211> 239 
<212> DNA 

<213> Escherichia coli 



<400> 55 

gattaacagc ggacttaata ctgagcaaag tacaatggcg acaatgcaaa ttataatgct 60 

attaatttta tactgttcaa gcatgttatg tctggctgaa gccaatttag ccacagtaac 120 

cgttaatgat ggcgaaactt catcaatatt aattcgtaaa gcatctatct ctgtatcatt 180 

aataatgctt ctgtaaatgc cagctgcgct gattgtctca ctagcaggta tgttattta 239 

<210> 56 
<211> 240 
<212> DNA 

<213> Escherichia coli 



<400> 56 

gattaacagc gggacttaat actgagcaaa gtacaatggc gacaatgcaa attataatgc 60 

tattaatttt atactgttca agcatgttat gtctggctga agccaattta gccacagtaa 120 

ccgttaatga tggcgaaact tcatcaatat taattcgtaa agcatctatc tctgtatcat 180 

taataatgct tctgtaaatg ccagctgcgc tgattgtctc actagcaggt atgttattta 240 

<210> 57 
<211> 330 
<212> DNA 

<213> Escherichia coli 
<400> 57 

cacaacccag tgatgtggct ggtgcgcttc aggatgcgat ctgcacgacc ttttgcacgc 60 
ggcataatgc gcttcatgct cgggccttcg tctacgaaaa ttttcgtaac tttcagatcg 120 
tcaatgtcag cgccatcgtt gtgttcagcg ttagcaatgg cagattccag aactttcttg 180 
accagtacag ccgctttctt gttggtgtag gtcaaaatat ccagagcctg cgacactttc 240 
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ttaccgcgaa tcaggtcagc aacaaggcga accttctgag cagaagaacg agcatggcga 300 
tgtttagcga tagtttccat ctcttcctcc 330 

<210> 58 
<211> 266 
<212> DNA 

<213> Escherichia coli 
<400> 58 

gctttcttgt tggtgtaggt caaaatatcc agagcctgcg acactttctt accgcgaatc 60 
aggtcagcaa caaggcgaac cttctgagca gaagaacgag catggcgatg tttagcgata 120 
gtttccatct cttcctccta ccttatttct tcttcgcttt tttatcagca gcgtggccgc 180 
gataagtacg agtcggtgcg aattcaccca gtttgtgacc aaccatttcg tcggttacaa 240 
ataccggaac gtgctgacga ccatta 266 

<210> 59 
<211> 186 
<212> DNA 

<213> Escherichia coli 
<400> 59 

gcgaatcagg tcagcaacaa ggcgaacctt ctgagcagaa gaacgagcat ggcgatgttt 60 
agcgatagtt tccatctctt cctcctacct tatttcttct tcgctttttt atcagcagcg 120 
tggccgcgat aagtacgagt cggtgcgaat tcacccagtt tgtgaccaac catttcgtcg 180 
gttaca 18 6 

<210> 60 
<211> 138 
<212> DNA 

<213> Escherichia coli 
<400> 60 

gaagaacgag catggcgatg tttagcgata gtttccatct cttcctccta ccttatttct 60 
tcttcgcttt tttatcagca gcgtggccgc gataagtacg agtcggtgcg aattcaccca 120 
gtttgtgacc aaccattt 138 

<210> 61 
<211> 183 
<212> DNA 

<213> Escherichia coli 
<400> 61 

tgtgttcagc gttagcaatg gcagattcca gaactttctt gaccagtaca gccgctttct 60 
tgttggtgta ggtcaaaata tccagagcct gcgacacttt cttaccgcga atcaggtcag 120 
caacaaggcg aaccttctga gcagaagaac gagcatggcg atgtttagcg atagtttcca 180 
tct 183 

<210> 62 
<211> 135 
<212> DNA 

<213> Escherichia coli 
<400> 62 

ggtcaaaata tccagagcct gcgacacttt cttaccgcga atcaggtcag caacaaggcg 60 
aaccttctga gcagaagaac gagcatggcg atgtttagcg atagtttcca tctcttcctc 120 
ctaccttatt tcttc 135 

<210> 63 
<211> 141 
<212> DNA 
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<213> Escherichia coli 



<400> 63 

tgctaaacga atagttacca taacatcctc ttgtgtgaat aaaacaaccg gaccccatcg 60 

aggaacggag tccggtgtca tattaaaagc ccgaaaattt tactcatttt tgcgggaatt 120 

gcaatcaaca gttgctaact c 141 



<210> 64 
<211> 144 
<212> DNA 

<213> Escherichia coli 



<400> 64 

tgctttgctc gcctcatgac gattatgatc gttttgtttt aaaggaggaa gtatcggcag 60 

tagacgaatg agtacgatgt aagcgcaaat ctcaatagcc acaaaaccaa tagaaatcaa 120 

cagttcttcc caggtcggga agta 144 

<210> 65 
<211> 70 
<212> DNA 

<213> Escherichia coli 
<400> 65 

cggcttagcg cgtcgttggc atcgagcatc gctcccatgc cgtttgccag ttgatagcgt 60 
ccatcttgcc 70 

<210> 66 
<211> 175 
<212> DNA 

<213> Escherichia coli 



<400> 66 

accatacaga gaggagatca aagatgtgcc acttgaccgg aacagcgttt acagcatgtt 60 

ggataaaagt atcaagttta ctcattcgct tcagtctgtt gttattgttg tgatttatga 120 

atcatagcac aggtcgtatc gacttctggc cctggcgctt taccccacca gggcc 175 



<210> 67 
<211> 867 
<212> DNA 

<213> Escherichia coli 



<220> 

<221> misc_feature 
<222> (1) . . . (867) 
<223> n « A,T,C or G 



<400> 67 

ccagatttgc ccccgcaaan ccgacccatt gaacgccgac ctgaccgtga atcaagcgtt 60 

catcaatccg gcttaaaaca atattnggca tagcgtgttc ctctttgttt tgcttattgg 120 

ccttcacagg ctgtgtggta ctgcgccagg atgtcctgaa tatggttgat aatgagttcc 180 

cgtggcgttg gctgcagctc gccggagcga actttcacgt actgcagcgg cagatactgg 240 

ctgatcagcg gcagcggaat tggtgaatcc gccagattac gtaccagatg agcgaaagcg 300 

tcatcaatct ggctgtccgg ccagtaatag cgcacgcgat ccgagtagct ataaccacgc 360 

gccagacgac gcgcgttgcc ggtcaccgtg gtagtggctt tgccagtatt cccgggcggt 420 

cgagcatcac gtcttccaag caccctgacg cagaccagaa cangctttcg ctggcaccag 480 

ttctttttna atcgccgcca gagagaacag agcttacgca gggcagaaag gtcancnctg 540 

ggccaacttt cagaatggca aagtgggcaa atcaccaagc ttgncgcagc gattgctgcg 600 

ttttgataat cgggtagaag tgcncttcgg aaaatcagcg tttncgtaag ttttccgacc 660 

catcntggnt taaaggcgct cngtttggcg gggcntgatn aatcaaatta acntttggng 720 

tgatcngaat tccaacnccn gggttgnanc cancaggggn aatgatgccn tgggcnanat 780 
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ngnnatttaa accttggntt tttaaaaggc ntggcctang ggctttcaac ntnggtnccc 840 
gggccngnat ncngcntgnt gacccct 8 67 

<210> 68 
<211> 703 
<212> DNA 

<213> Escherichia coli 



<220> 

<221> misc_feature 
<222> (1) . . . (703) 
<223> n = A,T,C or G 



<400> 68 

ccgccggggc 

tgaacgatca 

cttcngaata 

cagttttgcc 

gctatcagtc 

aggcagagtc 

agcctggcga 

cttatccttc 

gtggccagtt 

tctcagtaag 

gtaccagatc 

tcaattccag 



gggttttttt 
accagctcga 
cgagtgtaac 
acgatctcgt 
ttggcaagag 
gtcttgatga 
tggctgctgt 
tcagtaaaac 
ttccaggcgc 
agatttttta 
accgatatag 
atcgncaaca 



acgttgcttc 
tgtaagccat 
caccggcacg 
tatcacgagt 
taatcagcgg 
tttcatgacg 
tgcggttcag 
cttaacctgt 
atgcccagag 
ccaaggttag 
tggatagctt 
gggcgcaaca 



agattactct 
cggcgcgttg 
gctcgcgaaa 
acgggcgaat 
ctcaactacg 
aaccagtgaa 
ttgacgacca 
gatccggtta 
acagtccacg 
gcgttttaag 
ctgctttaag 
ggatccggat 



gcagcagctt 
tcgcctgcac 
cgcgggccca 
gccagacgac 
cggcgcagct 
cctgccatat 
ctcttacgat 
ctcgtcagcg 
ggaagccagc 
gagctcaacc 
gcagttagca 
cga 



ctgctttctc 
ggaagccaca 
gttcgttaaa 
gattagcaac 
ctttcgcttt 
tgcggaacat 
ggcgcatgac 
atgcttgccg 
acgtctttaa 
tcggtacgct 
gagcggacag 



60 
120 
180 
240 
300 
360 
420 
480 
540 
600 
660 
703 



<210> 69 
<211> 328 
<212> DNA 

<213> Escherichia coli 



<400> 69 

cgttgtcgcc tgcacggaag ccacacttca gaatacgagt gtaaccaccg gcacggctcg 60 

cgaaacgcgg gcccagttcg ttaaacagtt ttgccacgat ctcgttatca cgagtacggg 120 

cgaatgccag acgacgatta gcaacgctat cagtcttggc aagagtaatc agcggctcaa 180 

ctacgcggcg cagctctttc gctttaggca gagtcgtctt gatgatttca tgacgaacca 240 

gtgaacctgc catattgcgg aacatagcct ggcgatggct gctgttgcgg ttcagttgac 300 

gaccactctt acgatggcgc atgacctt 328 

<210> 70 
<211> 414 
<212> DNA 

<213> Escherichia coli 



<400> 70 

cgggtataaa aaaacccgcc ggggcgggtt tttttacgtt gcttcagatt actctgcagc 60 

agcttctgct ttctctgaac gatcaaccag ctcgatgtaa gccatcggcg cgttgtcgcc 120 

tgcacggaag ccacacttca gaatacgagt gtaaccaccg gcacggctcg cgaaacgcgg 180 

gcccagttcg ttaaacagtt ttgccacgat ctcgttatca cgagtacggg cgaatgccag 240 

acgacgatta gcaacgctat cagtcttggc aagagtaatc agcggctcaa ctacgcggcg 300 

cagctctttc gctttaggca gagtcgtctt gatgatttca tgacgaacca gtgaacctgc 360 

catattgcgg aacatagcct ggcgatggct gctgttgcgg ttcagttgac gacc 414 



<210> 71 
<211> 327 
<212> DNA 

<213> Escherichia coli 



<400> 71 
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cgcgaaacgc 
gggcgaatgc 
tcaactacgc 
accagtgaac 
tgacgaccac 
gatccggtta 



gggcccagtt 
cagacgacga 
ggcgcagctc 
ctgccatatt 
tcttacgatg 
ctcgtcagcg 



cgttaaaaca 
ttagcaacgc 
tttcgcttta 
gcggaacata 
gcgcatgacc 
atgcttg 



gttttgccac 
tatcagtctt 
ggcagagtcg 
gcctggcgat 
ttatccttct 



gatctcgtta 
ggcaagagta 
tcttgatgat 
ggctgctgtt 
caagtaaaac 



tcacgagtac 
atcaagcggc 
ttcatgacga 
gcggttcagt 
cttaacctgt 



<210> 72 
<211> 209 
<212> DNA 

<213> Escherichia coli 



60 
120 
180 
240 
300 
327 



<220> 

<221> misc_feature 
<222> (1) . . . (209) 
<223> n « A,T,C or G 



<400> 72 

caactctgtc tntttaagca gagtcgtctt gatgatttca tgacgaacca gtgaacctgc 60 

catattgcgg aacatagcct ggcgatggct gctgttgcgg ttcagttgac gaccactctt 120 

acgatggcgc atgaccttat ccttctcagt aaaaccttaa cctgtgatcc ggttactcgt 180 

cagcgatgct tgccggtggc cagttttcc 209 



<210> 73 
<211> 311 
<212> DNA 

<213> Escherichia coli 



<400> 73 

aacataagga gtgcattatg cgcatagcta aaattggggt catcgccctg ttcctgttta 60 

tggcgttagg cggaattggt ggcgtcatgc tcgcaggtta tacctttatt ttgcgtgctg 120 

gctaagcgcc tgcaccagcc tttcaaacag gcggtctgcg atgatcgccg cagtgccacc 180 

agtaacgccc cctggatcac atacgcggta ttaaatccgc taagcccgat gatgatgggc 240 

gtacccagcg tgctggcccc taccgttgag gcgatcgtcg ccgtaccaat gttgataatc 300 

accgaagttc g 311 

<210> 74 
<211> 167 
<212> DNA 

<213> Escherichia coli 



<400> 74 

cgcgaactcg atctggatac cgataatgtt agcccgcttt cagcgtgttc aggtctacta 60 

cacccgcctt ctactttagc caggtcagac agacgaattt cggctgtaat cgctgcttta 120 

cgagaagtga agccgaattt cggcagacga cggtacagag gcatctg 167 

<210> 75 
<211> 151 
<212> DNA 

<213> Escherichia coli 
<400> 75 

gcagaaaaac acgtttttta ggtttatcag catattgcgc tttcaattgc gcgtactgat 60 
ccagcaggga ttgcgcggct tgttcggcct tgtctggttg cggactccag ggggccagtt 120 
gacgtaacgc attggcaatt tgttcaatgc t 151 

<210> 76 
<211> 151 
<212> DNA 

<213> Escherichia coli 
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<400> 76 



gcgtcaaagt cgctgctgtc gataaagcgc 
cgcgcttccg ccagttccac cgccgcgtgc 
cggatcagct cgagaaaata gcgatccggg 



acgtcgttgg 
agatagcttt 
a 



tcgccacgac gggcaaaccg 
cttcatccgg cctgccggtg 



60 
120 
151 



<210> 77 
<211> 115 
<212> DNA 

<213> Escherichia coli 
<400> 77 

taaccgctat tgatcgcagg ctgaccgcag catccctgtt tctccgggaa atttacgcga 60 
cagccgagtt tttccagtag cagcacggag tctcgtgcca ttcttgattt caggg 115 

<210> 78 
<211> 220 
<212> DNA 

<213> Escherichia coli 
<400> 78 

taaaaatggt tgacataaac ttgccataag ccgtgggagg cacagaaaaa tcaatgatac 60 
ggaatgtgtt ctcactatga ccactgaaat caatattcgg cgtcccggtt tctagcaatt 120 
cgataaattg catatgttta ctgccgatag attttaaaaa atgataaacc tcaagagggt 180 
aatggacatt aacgttatta atgacggtta acgtattaaa 220 

<210> 79 
<211> 258 
<212> DNA 

<213> Escherichia coli 
<400> 79 

ctaaaggatt gagttatcgc attatgccca tgaagccatg tacgcccata tacacaccga 60 
ctaagccaat caacagactg gaaaaatagg gggcgcgttt agcgagagtg ttaaatccgc 120 
tccagcgttt tgcgacctgc tgaacgctga ttgctgcgcc aacgcctacg gtgacaagcg 180 
ttaacgccag gccaatgctg aaactgacga ccagtgttgc gcccagtgtc agggctttca 240 
actgaatgca tatcaaca 258 

<210> 80 
<211> 311 
<212> DNA 

<213> Escherichia coli 
<400> 80 

catatatttt atcaagcttc tggtatgttt ttttatagaa cgcaaaatca gtcttgagta 60 
gtacggaagg gctattgttg taattaataa tatcatctgg ttttaggaca agtaaaaaac 120 
ccgttaagta atttgcatta atgttatacg tatttgttaa atccgatatt tgctggtatt 180 
cacacttcat cttcttattt ttatcaagcc tagatagagg tgcaacatga ttttcaattt 240 
tatatttttt atcatttatt attatatcgt atgtataatc atgtaaaata ccattcgttg 300 
ttattggcat a 311 

<210> 81 
<211> 281 
<212> DNA 

<213> Escherichia coli 
<400> 81 

caaaatcagt cttgagtagt acggaagggc tattgttgta attaataata tcatctggtt 60 
ttaggacaag taaaaaaccc gttaagtaat ttgcattaat gttatacgta tttgttaaat 120 
ccgatatttg ctggtattca cacttcatct tcttattttt atcaagccta gatagaggtg 180 
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caacatgatt ttcaatttta tattttttat catttattat tatatcgtat gtataatcat 240 
gtaaaatacc attcgttgtt attggcataa atgcgaaagc a 281 

<210> 82 
<211> 402 
<212> DNA 

<213> Escherichia coli 
<400> 82 

aacaataaat atataacagc ggttatgagt atggctatta ttattgcgga aactaatgga 60 
atcattgccg ataaaatcat tggaatatgt tttatcttat taaggtggaa aaggagggaa 120 
tttgataaac acataaacag tgacaataaa atcgccccaa aaagattata tagtatattt 180 
aacaaaatat tattcatggc tatattgtct aaggcgtgta ttaacgccac atctgttatt 240 
gttagtacaa tccatgtaat agctaataat gttaaataag ctttaattct atgctgtata 300 
aaaacagatg caataaatac tgcaacagca gatatgaggt accatgcggt ttccagtgaa 360 
aaataaaatt gttttatgta atcaagtttc attgagcatt ca 402 

<210> 83 
<211> 273 
<212> DNA 

<213> Escherichia coli 
<400> 83 

aatgttatac gtatttgtta aatccgatat ttgctggtat tcacacttca tcttcttatt 60 
tttatcaagc ctagatagag gtgcaacatg attttcaatt ttatattttt tatcatttat 120 
tattatatcg tatgtataat catgtaaaat accattcgtt gttattggca taaatgcgaa 180 
agcaaccatc aattgtgcat tgtttgattt aaattttatt tgagagtcag taacacttga 240 
ggctagcata aatttatcat cttgcttgct caa 273 

<210> 84 
<211> 162 
<212> DNA 

<213> Escherichia coli 
<220> 

<221> misc_feature 
<222> (1) . . . (162) 
<223> n = A, T,C or G 

<400> 84 

gnnnnnnnnn nntttnttnt tttggggccg cgtacatang ccagggatct aggagtaaga 60 

atagcttcga attccctccc tggaggcacc acctaaattt tcctgaagtt gagcgaggtt 120 

aatgttggca taacgataaa aaacacccga tgaaaattcc tg 162- 

<210> 85 
<211> 131 
<212> DNA 

<213> Escherichia coli 
<400> 85 

ccaataataa taatgtcgca gtcgtcttcc atgtcatgcc ccagatatcc agaacacaac 60 
accctaacat agcgttactt aagggaaatt gaccgccgac acactcttat cattcgccgc 120 
gatagataca a 131 

<210> 86 
<211> 207 
<212> DNA 

<213> Escherichia coli 
<400> 86 
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gaattttcag acggctaaca atgagacacc agattaaatg atagtcgggg agggagagaa 60 

gaggagggcg caatggctgc gcccgaaaaa taaattagtc gcgcttcgcg ccagtacgca 120 

gcaggccgga tgcgccttca gaatagtcgc gaggcatctg caccggtgcc tgatcgttgc 180 

tggcttcaga ctctgccaga cgattac 207 

<210> 87 
<211> 247 
<212> DNA 

<213> Escherichia coli 



<400> 87 

taacgctaat gcactcaaca gctgggtttg ttttttcatt attcctgctc tcgtacctga 60 

atgataagaa aagagattca aaacagtttg atgttattga attttcagac ggctaacaat 120 

gagacaccag attaaatgat agtcggggag ggagagaaga ggagggcgca atggctgcgc 180 

ccgaaaaata aattagtcgc gcttcgcgcc agtacgcagc aggccggatg cgccttcaga 240 

atagtcg 247 



<210> 88 
<211> 238 
<212> DNA 

<213> Escherichia coli 



<400> 88 

tgaatgataa gaaaagagat tcaaaacagt ttgatgttat tgaattttca gacggctaac 60 

aatgagacac cagattaaat gatagtcggg gagggagaga agaggagggc gcaatggctg 120 

cgcccgaaaa ataaattagt cgcgcttcgc gccagtacgc agcaggccgg atgcgccttc 180 

agaatagtcg cgaggcatct gcaccggtgc ctgatcgttg ctggcttcag actctgcc 238 



<210> 89 
<211> 582 
<212> DNA 

<213> Escherichia coli 



<220> 

<221> misc_f eature 
<222> (1) . . . (582) 
<223> n = A,T,C or G 



<400> 89 

ttcaaaacag tttgatgtta ttgaattttc agacggctaa caatgagaca ccagattaaa 60 

tgatagtcgg ggagggagag aagaggaggg cgcaatggct gcgcccgaaa aataaattag 120 

tcgcgcttcg cgccagtacg cagcaggccg gatgcgcctt cagaatagtc gcgaggcatc 180 

tgcaccggtg cctgatcgtt gctggcttca gactctgcca gacgattacg gaacgggttt 240 

gcttcagcag acagttccgg cagcaggctg ctggagcttt ttgccttcta tcaaagcctg 300 

ccgggagagc ggaaacagcg cctgcacttt caccgtttta tgctgcgggt gcatgaagag 360 

ctaactgcct tacaggggca gaccgatccg ctggaaatta ttgccgatcg ctttaaagcc 420 

cgaaactgac gtgctctgtt ttgacgaatt ttttgtttct gatattaccc gatgccatgc 480 

tacttggcgg tctgatgaaa gccctgttcg ctcgcggtat taccctggta gcgacgtcaa 540 

atattncgcc ggatgaactt tatcgaaatg gcctgcaacg tg 582 



<210> 90 
<211> 287 
<212> DNA 

<213> Escherichia coli 



<220> 

<221> misc_f eature 
<222> (1) . . . (287) 
<223> n = A, T, C or G 
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<400> 90 

tgcactcaac agctgggttt gntttttcat tattcctgct ctcgtacctg aatgataaga 60 

aaagagattc aaaacagttt gatgttattg aattttcaga cggctaacaa tgagacacca 120 

gattaaatga tagtcgggga gggagagaag aggagggcgc aatggctgcg cccgaaaaat 180 

aaattaagtc gcgcttcgcg ccagtacgca gcaggccgga tgcgccttca gaatagtcgc 240 

gaggcatctg caccggtgcc tgatcgttgc tggcttcaga ctctgcc 287 



<210> 91 
<211> 200 
<212> DNA 

<213> Escherichia coli 
<400> 91 

taccagatca ccgatatagt ggatagcttc tgctttaagg cagttagcag agcggacagt 60 
caattccaga tcgtcaacag ggcgcagcag gatcggatcg aactctggtt tctcttcttt 120 
cacttcaggc tgacgtacat cacgtaagtc aacgaaagct tccagttgtt cagccagaat 180 
ggttgccgca cgacgaatcg 200 



<210> 92 
<211> 278 
<212> DNA 

<213> Escherichia coli 
<400> 92 

ccacgatctc gttatcacga gtacgggcga atgccagacg acgattagca acgctatcag • 60 

tcttggcaag agtaatcagc ggctcaacta cgcggcgcag ctctttcgct ttaggcagag 120 
tcgtcttgat gatttcatga cgaaccagtg aacctgccat attgcggaac atagcctggc 180 
gatggctgct gttgcggttc agttgacgac cactcttacg atggcgcatg accttatcct 240 
tctcagtaaa accttaacct gtgatccggt tactcgtc 278 



<210> 93 
<211> 424 
<212> DNA 

<213> Escherichia coli 



<400> 93 

cttagccatt tagtaaccct accttacttg cggaacggga agtcaaaggc agccagcaga 60 

gcgcggcctt cttcgtcaga tttcgcagta gtggtaatgg taatatccaa accacgaacg 120 

cggtcgactt tatcgtagtc gatttctggg aagatgatct gctcacggac acccatgctg 180 

tagttaccac gaccgtcgaa agacttagcg gacaggccac ggaagtcacg gatacgaggt 240 

acagcaatag tgatcaggcg ctcaaagaac tcccacatgc gttcgccacg cagagttact 300 

ttacagccga tcggatagcc ctgacggatt ttgaagcctg caacagattt gcgtgctttg 360 

gtgatcagcg gtttttgacc ggagattgct gccaggtctg ctgctgcgtt atccagcagt 420 

tttt ~ 424 



<210> 94 
<211> 161 
<212> DNA 

<213> Escherichia coli 



<400> 94 

gactgggagt taacaagcgt aactccctgg gaaatgcgag cttaaccact caggggttag 60 

ctttattact tagaaacggt cagacgagcg cggcctttag cacgacgacg tgccagaacc 120 

tgacgaccat ttttagtagc catacgagca cggaagccgt g 161 

<210> 95 
<211> 187 
<212> DNA 

<213> Escherichia coli 
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<400> 95 

tggctgctgg aagacgaatg tgaattgact gggagttaac aagcgtaact ccctgggaaa 60 

tgcgagctta accactcagg ggttagcttt attacttaga aacggtcaga cgagcgcggc 120 

ctttagcacg acgacgtgcc agaacctgac gaccattttt agtagccata cgagcacgga 180 

agccgtg 187 

<210> 96 
<211> 236 
<212> DNA 

<213> Escherichia coli 
<400> 96 

tgccagcccg ttgtggctgc tggaagacga atgtgaattg actgggagtt aacaagcgta 60 
actccctggg aaatgcgagc ttaaccactc aggggttagc tttattactt agaaacggtc 120 
agacgagcgc ggcctttagc acgacgacgt gccagaacct gacgaccatt tttagtagcc 180 
atacgagcac ggaagccgtg agaacggttg cgcttcagta cagacggttg aaaagt 236 

<210> 97 
<211> 828 
<212> DNA 

<213> Escherichia coli 
<220> 

<221> misc_feature 
<222> (1) . . . (828) 
<223> n = A, T, C or G 



<400> 97 

gcgaattcag gcggccgaga atggtaattt gcggcgtgcc agcccgttgt ggctgctgga 60 

agacgaatgt gaattgactg ggagttaaca agcgtaactc cctgggaaat gccagcttaa 120 

ccactcaggg gttagcttta ttacttatta aacggtcaga cgagcgcggg ctttatcacg 180 

accacgtgcc ngaacctgac gaccattttt agnagccata ctagcacgga agccgtgaga 240 

acggnngcgc ttcagtacan acggntgaaa agngcctttc atggcnattt ctggggatnn 300 

cgtcnacctg cngnnctaac ataangggnn ggggccncta aacgggtcct tgaggggttt 360 

tttgaagctt gggcttttca ncctgatnca gattaaatca naacgccnga ancggtctga 420 

taaaacannc nnnncntngn cttntcnnna tttgcnacnn ntccnccacn nnngtgcatn 4 80 

ccnntnncan agnactncga ntnnccnncn tnnngcnttt nngnngcnnt atnnangctg 540 

tctcgncccn nnnccncttn nnnnnttnnc cgcnnnatcc ntnnactnna cccnnntnca 600 

cgnnannctn cnnccgcncn nnnnctcnat ntnttnnngn tctcnnntgn cnnnnnnntn 660 

tctnatgncg tnncncnnnc nnnctcctnn nncatgnctt tctncnactn nntatcnncn 720 

gtcnnttncn ncnnnnaann ncntctcntc nnnnnngnnt nccatcgntc anncgnncgn 7 80 

ncnntnacnt cctantnnnn nnnnntcnnn ntatcnnnat ctggcccg 828 



<210> 98 
<211> 183 
<212> DNA 

<213> Escherichia coli 



<400> 98 

gctgctcgtc ggtcgcgtcg aggttaacga acagatcgaa tacctgatcc acaacccaat 60 

caggacgcgc gccagggcgg tcaactttgt tgataacaac aatcggcttc aggccgtaag 120 

caaacgcttt tttggttacg aagcgcgttt gcggcatcgg gccgtcaaat gcgtcaacca 180 

cca 183 



<210> 99 
<211> 192 
<212> DNA 

<213> Escherichia coli 
<400> 99 
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ggcatcatgc tgtgttgcgc ggcaacgcga aagacgtctg gcgtcaggca tggcagggtg 60 

agtatgggcg tagcgctacg ttgccatact ggtatttgcg agagatcttt attcagaaac 120 

cggaggtaat ctttgatgat gtcgtgacta atatgagcaa ccaaagtgtt gttaagagag 180 

gagacgtcga ta 192 



<210> 100 
<211> 242 
<212> DNA 

<213> Escherichia coli 



<400> 100 

taattttgtt ggaaagtaat catatataaa ttttggaacc aaccctgcaa tacctaaatt 60 

tgtaaacatt aatccaaaga gacataatcc tataagagaa gagtttttca cagaaataaa 120 

aaaaagagga caaaggaaaa taaatgaagt tattagaccg actacaaagg ctttttttac 180 

accaatctta tcaccaacaa aaccaaaaaa tattgtacct gtcagtgttc ctaaacctgc 240 

ta 242 



<210> 101 
<211> 177 
<212> DNA 

<213> Escherichia coli 



<400> 101 

gacgcggcga acgatttcgg cttgttcttt tactggaggt aataaaacaa cttgggattt 60 

gatatctttt cctgaaatac ctttttgacc agaagttgtt ttcacgcagt tcatcattgc 120 

atttcgtgct gagggggatg aaaaaaatat ttcgatatat tctggtaaag catcttt 177 



<210> 102 
<211> 250 
<212> DNA 

<213> Escherichia coli 



<400> 102 

gacctgtttt tctatggtgt cggcgtaggc gaagagttgc tcgacgcggc gaacgatttc 60 

ggcttgttct tttactggag gtaataaaac aacttgggat ttgatatctt ttcctgaaat 120 

acctttttga ccagaagttg ttttcacgca gttcatcatt gcatttcgtg ctgaggggga 180 

tgaaaaaaat atttcgatat attctggtaa agcatctttg gttaatcgag ctcgaataag 240 

tttatcagga 250 



<210> 103 
<211> 194 
<212> DNA 

<213> Escherichia coli 



<400> 103 

ttaggtcgaa cacctttcgg cgcgaacaag ccctggtatt cattggcgat gctgtgtcga 60 

atcacctgct caggtgaaaa accctgctgg cggaattttt ccagctcccg tatcgccccg 120 

ttagcgccct gctccgttcg aatcggtttt cgcaatgcct ggcgaaattc aacccactca 180 

cgccaaagcg agac 194 

<210> 104 
<211> 173 
<212> DNA 

<213> Escherichia coli 



<400> 104 

tccataaatc cctcttaatt tcacgtatcg aataacgttg ttttaatcta gctatgagga 60 

acaaataaaa taaaaatatt atttttaata ctgaagcaac atctttggct gctttctttt 120 

ttaatattgc tggccgattc attttctgat atcggccatt tttattgcgg gta 173 
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<210> 105 
<211> 173 
<212> DNA 

<213> Escherichia coli 
<400> 105 

tccataaatc cctcttaatt tcacgtatcg aataacgttg ttttaatcta gctatgagga 60 
acaaataaaa taaaaatatt atttttaata ctgaagcaac atctttggct gctttctttt 120 
ttaatattgc tggccgattc attttctgat atcggccatt tttattgcgg gta 173 

<210> 106 
<211> 142 
<212> DNA 

<213> Escherichia coli 
<400> 106 

aacagctcca gcgccaacag cgctccaacg cccgccgagg gcagggtgga gagaatcgtc 60 
agcggatgta cgtaactctc gtaaaggata cccagcacga tatacaccgt ggcgatggcg 120 
gcgataatca ggatcacctg eg 142 

<210> 107 
<211> 263 
<212> DNA 

<213> Escherichia coli 
<400> 107 

aaaatcttat tcagcgcctt catttggttt gggcctttaa tcccttagag ccatcccgtt 60 
gagcaagggt ggcgtaaccc gatcatcaca ataatatcga ccacaatagt ggtgacgccc 120 
gagcacgata tactgeatea gttgcggctg ttgcggcatg atgaattgcg gaaatagege 180 
cgccagaaac acaatacttt tgggattggt gagattcaca aaaactgege gctggaacaa 240 
atgtcgacgc gattgagtag ggg 263 

<210> 108 
<211> 235 
<212> DNA 

<213> Escherichia coli 
<400> 108 

teggaagect gaatgeggat aegtagegea gettgeageg gttcgtccag cttattgacc 60 
cacaaatget gatcgcccag cgccagcgcg gtcatcgcgt aatgcggatg atgetcaage 120 
accgtcactt teagaatget actttgttgc tettteggea gecaeggatt catcacgcta 180 
ctgccccaca cttcctccag cgcgccaaag gctttcacct gaeegtttte cagta 235 

<210> 109 
<211> 147 
<212> DNA 

<213> Escherichia coli 
<400> 109 

gecaggatga cactttttgt gtaatggccg gagttcacat ttttaattta gatcaaaggg 60 
gggaagaata agcagaaaaa gcccgccata acagegggea ggagggattt agaactgata 120 
aaccagacct aaagcgacaa tatcatc 147 

<210> 110 
<211> 241 
<212> DNA 

<213> Escherichia coli 
<400> 110 

tcataaaacc tcctgtttta gtatcegcat aaagtgtaac gecagatgae actttttgtg 60 
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taatgacgga gttcacattt ttaatttaga tcaaaggagg aagaataagc agaaaaagcc 120 

cgccataaca gcgggcagga ggatttagaa ctgataaacc agacctaaag cgacaatatc 180 

atcggtagag atgccattgg cagcgtagaa gctgtcatct tcatccaaca ggttgatttt 240 

a 241 



<210> 111 
<211> 125 
<212> DNA 

<213> Escherichia coli 



<400> 111 

tccgcataaa gtgtaacgcc agatgacact ttttgtgtaa tgacggagtt cacattttta 60 

atttagatca aaggaggaag aataagcaga aaaagcccgc cataacagcg ggcaggagga 120 

tttag 125 



<210> 112 
<211> 218 
<212> DNA 

<213> Escherichia coli 
<220> 

<221> misc_feature 
<222> (1) . . . (218) 
<223> n - A,T,C or G 



<400> 112 

taatgaccga gttcacattt ttaatttana tcaaangagg aaagaataag cngaaaaagc 60 

ccgccataac agcgggcagg aggatttaga actgataaac cagacctaaa gcgacaatat 120 

catcggtaga gatgccattg gcagcgtana agctgtcatc ttcatccaac aggttgattt 180 

tatagtcaac gtaggtggac atgtttttat tgaaatag 218 

<210> 113 
<211> 148 
<212> DNA 

<213> Escherichia coli 



<400> 113 

cgcataaagt gtaacgccag atgacacttt ttgtgtaatg acggagttca catttttaat 60 

ttagatcaaa ggaggaagaa taagcagaaa aagcccgcca taacagcggg caggaggatt 120 

tagaactgat aaaccagacc taaagcga 148 

<210> 114 
<211> 193 
<212> DNA 

<213> Escherichia coli 



<400> 114 

tgcggtattg cgcaggccaa agaaagcaag cgaaggtgtc aggtcgctca tttcgacttc 60 

gggccagcgt ggcacaaaga ccaatggtga actgccatga caggtattgg ccccagcagc 120 

ggaactcaca aggcaccata acgtcccctc cctgataacg ctgatactgt ggtcgcggtt 180 

atgccagttg gca 193 



<210> 115 
<211> 117 
<212> DNA 

<213> Escherichia coli 



<400> 115 

tgccatgaca ggtattggcc ccagcagcgg aactcacaag gcaccataac gtcccctccc 60 
tgataacgct gatactgtgg tcgcggttat gccagttggc atcttcacgt aaataga 117. 
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<210> 116 
<211> 133 
<212> DNA 

<213> Escherichia coli 



<400> 116 

taacctgccc gcgcaaacgt gttattcgat aaggctttct gaaggggtga tcaagttgcg 60 

ggttcaggca gctttcataa taggcctgaa tctccatcat gcgagtgcga tgacgctggt 120 

aacgacgcca ggc 133 



<210> 117 
<211> 288 
<212> DNA 

<213> Escherichia coli 



<400> 117 

taatctagct atgaggaaca aataaaataa aaatattatt tttaatactg aagcaacatc 60 

tttggctgct ttctttttta atattgctgg ccgattcatt ttctgatatc ggccattttt 120 

attgcgggta tcaggccagt ttatggttag ccattaattc cagcgcctgc actaacgcag 180 

agtgatccaa ctggctgcca ccgtttgccg cacaggtatt aaataactcc tggcaggtcg 240 

cagtgtttgg caggttcagc gcaagtgctt tcgcactttg cagtgcca 288 

<210> 118 
<211> 246 
<212> DNA 

<213> Escherichia coli 



<400> 118 

taatttcacg tatcgaataa cgttgtttta atctagctat gaggaacaaa taaaataaaa 60 

atattatttt taatactgaa gcaacatctt tggctgcttt cttttttaat attgctggcc 120 

gattcatttt ctgatatcgg ccatttttat tgcgggtatc aggccagttt atggttagcc 180 

attaattcca gcgcctgcac taacgcagag tgatccaact ggctgccacc gtttgccgca 240 

caggta 24 6 

<210> 119 
<211> 525 
<212> DNA 

<213> Escherichia coli 



<400> 119 

taatttcacg 

atattatttt 

gattcatttt 

attaattcca 

caggtattaa 

gcactttgca 

aaggtgcgtt 

cccatcagcg 

gcttcagaaa 



tatcgaataa 
taatactgaa 
ctgatatcgg 
gcgcctgcac 
ataactcctg 
gtgccaggtt 
taatcatacg 
cctggcgcac 
ccgcttcaat 



cgttgtttta 
gcaacatctt 
ccatttttat 
taacgcagag 
gcaggtcgca 
gagatctttc 
ctcgccatga 
acgtaccggg 
attgagcgcc 



atctagctat 
tggctgcttt 
tgcgggtatc 
tgatccaact 
gtgtttggca 
tggtgcagag 
acttccagaa 
tccgcaccgg 
acgataatct 



gaggaacaaa 
cttttttaat 
aggccagttt 
ggctgccacc 
ggttcagcgc 
cgattttgaa 
tacgtgagga 
cttttgaagc 
gattt 



taaaataaaa 
attgctggcc 
atggttagcc 
gtttgccgca 
aagtgctttc 
gcccggatta 
agcaaagccg 
aaatagcagg 



60 
120 
180 
240 
300 
360 
420 
480 
525 



<210> 120 
<211> 161 
<212> DNA 

<213> Escherichia coli 



<220> 

<221> misc_feature 
<222> (1) . . . (161) 
<223> n = A, T, C or G 
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<400> 120 

tcttaatttc acgtatcgaa taacgntgnt ttaatctagc tatgaggnac aaataaaata 60 

aaaatattat ttttaatact gaagcaacat ctttggctgc tttctttttt aatattgctg 120 

gccgattcat tttctgatat cggccatttt tattgcgggt a 161 



<210> 121 
<211> 535 
<212> DNA 

<213> Escherichia coli 



<220> 

<221> raisc_feature 
<222> (1) . . . (535) 
<223> n = A,T,C or G 



<400> 121 

cagtatcaat cagcgactgg cgctgtgctt ctgctgcatc tactgcgacg ctatgctgtg 60 

cctcggtatc cgtcacccat ttctcaccat cccatttatc gtatggcgtt aatggggcga 120 

tagtcgcggg acgcatacaa acataacccc tggatgttat gtgtctatcg agaatcaaag 180 

tggaagtcct tatgttaagt ataagacagg aatcacttat aacgataatc agttagtata 240 

tgtatctatc actgttgatg ataatatcag cacttggttc tggagggggt ttgttgtggg 300 

caatgatgca tttaagttat cgnctgcaga tagaggagat attacaataa acaacgaatc 360 

agggcatttg atagtcaata ccgcaattct atcaggagat atagtcactc taagaggagg 420 

agaaattagg ttggtattat agcttgtgcg cgccatgatt ggcgcgcaat ttaaacttag 480 

tgctttacat cgctattgnc ttgatttctt tgaattattc tataaattaa aaaaa 535 

<210> 122 
<211> 267 
<212> DNA 

<213> Escherichia coli 



<400> 122 

ccattttttc aatgaaaatt aaaaagggga tttttgagaa gggtagaatg tcgtttgttt 60 

ttttctttat tatgtctggt gacgcggggg gatagaacgt ctgtgcatat agtagtgaat 120 

ttattcaggt ctacattatt ctctattttg gtcagtttta tgagttcatt tacagccata 180 

ctcattaaat catctgttat cgaaaattta tatatgctca tagcagataa tattgcttgc 240 

tgaattatct cttcagaaat atccgct 267 



<210> 123 
<211> 203 
<212> DNA 

<213> Escherichia coli 
<400> 123 

ttttttcttt attatgtctg gtggacgccg gggggataga accgtctgtg catatagtag 60 
tgaatttatt caggtctaca ttattctcta ttttggcagt tttatgagtt catttacagc 120 
catactcatt aaatcatctg ttatcgaaaa tttatatatg ctcatagcag ataatattgc 180 
ttgctgaatt atctcttcag aaa 203 

<210> 124 
<211> 247 
<212> DNA 

<213> Escherichia coli 



<400> 124 

ccttaatgtg tatgacctgg attcatagta tcaattgcca ttttttcaat gaaaattaaa 60 

aaggggattt ttgagaaggg tagaatgtcg tttgtttttt tctttattat gtctggtgac 120 

gcggggggat agaacgtctg tgcatatagt agtgaattta ttcaggtcta cattattctc 180 

tattttggtc agttttatga gttcatttac agccatactc attaaatcat ctgttatcga 240 
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aaattta 247 

<210> 125 
<211> 145 
<212> DNA 

<213> Escherichia coii 
<400> 125 

aaattaaaaa ggggattttt gagaagggta gaatgtcgtt tgtttttttc tttattatgt 60 
ctggtgacgc ggggggatag aacgtctgtg catatagtag tgaatttatt caggtctaca 120 
ttattctcta ttttggtcag tttta 145 

<210> 126 
<211> 469 
<212> DNA 

<213> Escherichia coli 



<400> 126 

aatgagaatg attggccttc tttatgattt taaggattat gcttctaaaa tggcggagaa 60 

catggcgagg cttgctgcct tacttcatta cttcagcggt gatggaggcg atatatctgt 120 

taccgggtaa tgactccaac ttattgatag tgttttatgt tcagataatg cccgatgact 180 

ttgtcatgct gctccaccga ttttgagaac gacagcgact tccgtcccag ccgtgccagg 240 

tgctgcctca gattcaggtt atgccgctca attcgctgcg tatatcgctt gctgattacg 300 

tgcagctttc ccttcaggcg ggattcatac agcggccagc catccgtcat ccatatcacc 360 

acgtcaaagg gtgacagcag gctcataaga cgccccagcg tcgccatagt gcgttcaccg 4 20 

aatacgtgcg caacaaccgt cttccggagc ctgtcatacg cgtaaaaca 4 69 



<210> 127 
<211> 596 
<212> DNA 

<213> Escherichia coli 
<220> 

<221> mis cofeature 
<222> (1) . . . (596) 
<223> n = A,T,C or G 



<400> 127 

cgatcagtta tccctgtgtc tgcaacattt ccacgagaag ctatcattgt tatatctaat 60 

gctagtcccg gattattcaa tattagagag tcaaatattc tttgtgatgc tgtatcgttc 120 

caccattttg tcaatgcttt tgcagcatta tcatctgctt gctgctgtgc agggcttttg 180 

ggccatggaa ttggcaaggg gaatggtcta ggaaaaactt ctaaccccag aggatctgta 240 

ttcgtaactg gattcaacgg atactgataa aaattccatc cccccttcag cccaatcgga 300 

tcctgagtga tataccgccc ctgcagcggg tcataatagc ggtggcggtt gtaatacagg 360 

ccggactcct catcatactg ctgccccggc aggcggataa gctgctgcag ctgatgcggg 420 

ttctcttcat tcagcaggtt gccccattca tcgtattctg cgcaccattc tgttgccctt 480 

ccttgctgat aagggccagc ggcaggccgc gatggtcgca gtggtacang tggatttttc 540 

gcgccggcgt gtacaccggg tccatctggt tttgcatctg ctccacggnc angccg 596 



<210> 128 
<211> 963 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . . (963) 

<400> 128 

gtg atg ate aaa acg cgt ttt tct cgc tgg eta acg ttt ttt acg ttc 48 
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Met Met He Lys Thr Arg Phe Ser Arg Trp Leu Thr Phe Phe Thr Phe 

1 5 10 15 

gcc get gec gtg gcg ctg gcg eta ccg gca aaa gee aac acc tgg ccg 

Ala Ala Ala Val Ala Leu Ala Leu Pro Ala Lys Ala Asn Thr Trp Pro 

20 25 30 



gtc ggc ttt etc get ctg tta cag get aac ccc ggc gtt gat cct tac 
Val Gly Phe Leu Ala Leu Leu Gin Ala Asn Pro Gly Val Asp Pro Tyr 

70 75 80 



65 



96 



ctg ccg cca gcg ggc agt cgt ctg gtt ggc gaa aac aaa ttt cat gtg 144 
Leu Pro Pro Ala Gly Ser Arg Leu Val Gly Glu Asn Lys Phe His Val 
35 40 45 

gtg gaa aat gac ggt ggt tct ctg gaa gcc ate gcc aaa aaa tac aac 192 
Val Glu Asn Asp Gly Gly Ser Leu Glu Ala He Ala Lys Lys Tyr Asn 
50 1 ^ 55 60 



240 



288 



336 



384 



gta ccg cgc gcg ggc age gtg tta acg ate ccg ttg caa acc eta ctt 
Val Pro Arg Ala Gly Ser Val Leu Thr He Pro Leu Gin Thr Leu Leu 
85 90 95 

cca gat gcg ccg cgc gaa ggc att gtg ate aac att gcg gag ctg cgt 
Pro Asp Ala Pro Arg Glu Gly He Val He Asn He Ala Glu Leu Arg 
100 105 110 

etc tat tac tac ccg ccg ggt aaa aat teg gta acc gtg tat cca ata 
Leu Tyr Tyr Tyr Pro Pro Gly Lys Asn Ser Val Thr Val Tyr Pro lie 
115 120 125 

ggt att ggt cag tta ggt ggt gac acg ctg aca ccg aca atg gtg acc 432 
Gly He Gly Gin Leu Gly Gly Asp Thr Leu Thr Pro Thr Met Val Thr 
130 135 140 

acc gtt tea gac aaa cgt gca aac cca acc tgg acg cca acg gca aac 
Thr Val Ser Asp Lys Arg Ala Asn Pro Thr Trp Thr Pro Thr Ala Asn 
145 150 155 160 

ate cgc gcc cgt tat aaa gca cag gga att gag ttg cct gcg gta gtg 
He Arg Ala Arg Tyr Lys Ala Gin Gly He Glu Leu Pro Ala Val Val 
165 170 175 



480 



528 



ccg get gga ctg gat aac cca atg ggc cat cat gcg att cgt ctg gcg 576 
Pro Ala Gly Leu Asp Asn Pro Met Gly His His Ala He Arg Leu Ala 
180 185 190 

gcc tat ggc ggc gtt tat ttg ctt cat ggt acg aac gcc gat ttc ggc 624 
Ala Tyr Gly Gly Val Tyr Leu Leu His Gly Thr Asn Ala Asp Phe Gly 
195 200 205 

att ggc atg egg gta agt tct ggc tgt att cgt ctg egg gat gac gat 
He Gly Met Arg Val Ser Ser Gly Cys He Arg Leu Arg Asp Asp Asp 
210 215 220 

ate aaa aca etc ttt age cag gtc acc cca ggc acc aaa gtg aat ate 
He Lys Thr Leu Phe Ser Gin Val Thr Pro Gly Thr Lys Val Asn He 
225 230 235 240 

ate aac act ccg ata aaa gtc tct gcc gaa cca aac ggt gcg cgt ctg 



672 



720 



7 68 
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lie Asn Thr Pro He Lys Val Ser Ala Glu Pro Asn Gly Ala Arg Leu 
245 " 250 255 

gtt gaa gta cat cag ccg ctg tea gag aag att gat gac gat ccg cag 816 

Val Glu Val His Gin Pro Leu Ser Glu Lys He Asp Asp Asp Pro Gin 

260 265 270 

ctg ctg cca att acg ctg aat age gca atg caa tea ttt aaa gat gca 864 

Leu Leu Pro He Thr Leu Asn Ser Ala Met Gin Ser Phe Lys Asp Ala 

275 * 280 285 

gca caa act gac get gaa gtg atg caa cat gtg atg gat gtc cgt tec 912 

Ala Gin Thr Asp Ala Glu Val Met Gin His Val Met Asp Val Arg Ser 

290 295 300 

ggg atg ccg gtg gat gtc cgc cgt cat caa gtg age cca caa acg ctg 960 

Gly Met Pro Val Asp Val Arg Arg His Gin Val Ser Pro Gin Thr Leu 

305 310 315 320 

taa 963 



<210> 129 
<211> 2187 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . . (2187) 

<400> 129 

atg att act cgt att cct cgt agt tct ttc tct gca aat att aat aat 48 

Met He Thr Arg He Pro Arg Ser Ser Phe Ser Ala Asn He Asn Asn 

1 5 10 15 

aca gee cag aca aat gaa cac caa acc ctg agt gaa ttg ttt tat aaa 96 
Thr Ala Gin Thr Asn Glu His Gin Thr Leu Ser Glu Leu Phe Tyr Lys 
20 25 30 

gaa etc gag gat aaa ttt tct ggc aag gag ctg gcg acg cct eta tta 14 4 

Glu Leu Glu Asp Lys Phe Ser Gly Lys Glu Leu Ala Thr Pro Leu Leu 
35 40 45 

aaa age ttc tea gag aac tgt aga caa aat ggt cgt cat ate ttt age 192 
Lys Ser Phe Ser Glu Asn Cys Arg Gin Asn Gly Arg His He Phe Ser 
50 55 60 

aac aag gat ttt gtc att aaa ttt tec acg tec gtc tta caa get gat 240 
Asn Lys Asp Phe Val He Lys Phe Ser Thr Ser Val Leu Gin Ala Asp 
65 70 75 80 

aag aaa gaa att acg ata att aat aaa aac gaa aac acg aca ctt act 288 
Lys Lys Glu He Thr He He Asn Lys Asn Glu Asn Thr Thr Leu Thr 
85 90 95 

caa acc att gee cca ata ttt gaa aaa tac eta atg gaa att tta cct 336 
Gin Thr He Ala Pro He Phe Glu Lys Tyr Leu Met Glu He Leu Pro 
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100 105 HO 

caa cgc tea gac act ctt gat aaa caa gaa tta aac eta aaa tea gat 
Gin Arg Ser Asp Thr Leu Asp Lys Gin Glu Leu Asn Leu Lys Ser Asp 
115 120 125 

aga aaa gaa aaa gaa ttc cca aga att aaa ctt aat ggt caa tgt tat 
Arg Lys Glu Lys Glu Phe Pro Arg He Lys Leu Asn Gly Gin Cys Tyr 
130 135 140 

ttt ccg ggg cga ccc caa aac cgt ata gta tgc cga cac att get gca 
Phe Pro Gly Arg Pro Gin Asn Arg He Val Cys Arg His He Ala. Ala 
145 ' 150- 155 160 

caa tat att aat gat att tat cag aat gtt gat tac aaa ccc cat caa 
Gin Tyr He Asn Asp He Tyr Gin Asn Val Asp Tyr Lys Pro His Gin 
165 170 175 

gat gat tac tct tea get gaa aaa ttt etc acg cac ttc aac aaa aaa 
Asp Asp Tyr Ser Ser Ala Glu Lys Phe Leu Thr His Phe Asn Lys Lys 
180 185 190 

tgc aaa aac cag act ttg gcg ttg gtt tec age cgt cct gag ggg cgt 
Cys Lys Asn Gin Thr Leu Ala Leu Val Ser Ser Arg Pro Glu Gly Arg 
195 200 205 

tgc gtt get gee tgc ggt gat ttc ggg eta gtt atg aaa gca tat ttt 
Cys Val Ala Ala Cys Gly Asp Phe Gly Leu Val Met Lys Ala Tyr Phe 
210 215 220 

gac aag atg gaa tea aat ggc ate agt gtt atg gca gec ata tta ctg 
Asp Lys Met Glu Ser Asn Gly He Ser Val Met Ala Ala He Leu Leu 
225 230 235 240 



tat tct ctg atg gat ttt atg aat gta gat tat age etc ctg aaa tgg 
Tyr Ser Leu Met Asp Phe Met Asn Val Asp Tyr Ser Leu Leu Lys Trp 
290 295 300 

tea aat gat cat gtt att aat caa tct gtt gca ata att cca gca ctt 
Ser Asn Asp His Val He Asn Gin Ser Val Ala He He Pro Ala Leu 
305 310 315 320 

ccg aaa gaa cag eta ttg atg tta aaa gga tct gtg gat gaa ata acc 
Pro Lys Glu Gin Leu Leu Met Leu Lys Gly Ser Val Asp Glu He Thr 
325 330 335 

cct cca tta tea cct gca acg atg aat ttg eta atg gca att ggt cag 
Pro Pro Leu Ser Pro Ala Thr Met Asn Leu Leu Met Ala He Gly Gin 



384 



432 



480 



528 



576 



624 



672 



720 



768 



816 



gtg gat aac cat get ttg acg gtc egg eta aga ata aag aac aca act 
Val Asp Asn His Ala Leu Thr Val Arg Leu Arg He Lys Asn Thr Thr 
245 250 255 

gaa gga tgt acc cat tac gtg gtt teg gtt tat gat cct aat gta act 
Glu Gly Cys Thr His Tyr Val Val Ser Val Tyr Asp Pro Asn Val. Thr 
260 265 270 

aac gat aaa ata aga att atg age gaa age aaa gag aat att aaa cac 864 
Asn Asp Lys He Arg He Met Ser Glu Ser Lys Glu Asn He Lys His 
275 280 285 



912 



960 



1008 



1056 
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340 345 350 

aat cac caa ctt acg caa ctg atg att caa etc cag aaa atg cca gaa 1104 
Asn His Gin Leu Thr Gin Leu Met He Gin Leu Gin Lys Met Pro Glu 
355 360 365 

eta cat aga aca gaa atg ttg act gee tat aat agt att aac ctg cca 1152 
Leu His Arg Thr Glu Met Leu Thr Ala Tyr Asn Ser He Asn Leu Pro 
370 375 380 



ggt tta tat ttg get ata aat tat ggt aat gcg gat ate gtt gag act 
Gly Leu Tyr Leu Ala He Asn Tyr Gly Asn Ala Asp He Val Glu Thr 
385 390 395 400 



gga tta ttt tta gcg ata tea cgt aag gat aaa aat gtt gta ace teg 
Gly Leu Phe Leu Ala He Ser Arg Lys Asp Lys Asn Val Val Thr Ser 
435 440 445 

att ctg aac gee tta cct aaa ctg gee gca aca cat cat tta gat aac 
He Leu Asn Ala Leu Pro Lys Leu Ala Ala Thr His His Leu Asp Asn 
450 455 460 

gaa caa gtg tat aaa ttc ctg agt gec aaa aat aga acg tec age cat 
Glu Gin Val Tyr Lys Phe Leu Ser Ala Lys Asn Arg Thr Ser Ser His 
465 470 475 480 

gtt tta tac cat gtt atg gcg aat ggt gat gec gac atg ctg aaa att 
Val Leu Tyr His Val Met Ala Asn Gly Asp Ala Asp Met Leu Lys He 
485 490 495 



gaa cag gta etc gat etc ctg aag gca aag gat ttt tat ggt tgc cca 
Glu Gin Val Leu Asp Leu Leu Lys Ala Lys Asp Phe Tyr Gly Cys Pro 
515 520 525 



tec gat att gta gat ctt ctg acc get aaa agt ctt gcg cgc gac acg 
Ser Asp He Val Asp Leu Leu Thr Ala Lys Ser Leu Ala Arg Asp Thr 
565 570 575 



1200 



att ttc aat tea ttg tea gaa aca gga tat gaa gga tta etc teg aaa 1248 

He Phe Asn Ser Leu Ser Glu Thr Gly Tyr Glu Gly Leu Leu Ser Lys 
405 410 415 

aaa aat etc atg cat att ctg gag gca aaa gat aaa aat ggt ttt tct 1296 

Lys Asn Leu Met His He Leu Glu Ala Lys Asp Lys Asn Gly Phe Ser 
420 425 430 



134< 



1392 



1440 



1488 



gtt ttg aac gcg tta cct ttg tta att cgc aca tgt cat ttg act aaa 1536 
Val Leu Asn Ala Leu Pro Leu Leu He Arg Thr Cys His Leu Thr Lys 
500 505 510 



1584 



gga eta tat ctg gcg atg caa aat gga cat age gat ate gtg aaa gtt 1632 

Gly Leu Tyr Leu Ala Met Gin Asn Gly His Ser Asp He Val Lys Val 
530 535 540 

att ctt gaa gca ttg ccc age eta gec cag gaa att aac att tea get 1680 

He Leu Glu Ala Leu Pro Ser Leu Ala Gin Glu He Asn He Ser Ala 
545 550 555 560 



1728 



ggt ttg ttt atg gec atg cag cgc gga cac atg aac gtt att aat act 177 6 
Gly Leu Phe Met Ala Met Gin Arg Gly His Met Asn Val He Asn Thr 
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580 585 590 

att ttt aac gca tta ccc act ctg ttt aat acg ttt aaa ttc gat aaa 

He Phe Asn Ala Leu Pro Thr Leu Phe Asn Thr Phe Lys Phe Asp Lys 
595 600 605 

aaa aat atg aag ccc etc etc ctg gca aat aat tct aat gaa tat ccc 

Lys Asn Met Lys Pro Leu Leu Leu Ala Asn Asn Ser Asn Glu Tyr Pro 

610 615 620 

ggt ttg ttt tea gcg ata cag cat aaa caa caa aat gtt gta gag acg 

Gly Leu Phe Ser Ala He Gin His Lys Gin Gin Asn Val Val Glu Thr 
625 630 635 640 

gtt tat ctt get tta tct gac cat gca cgc ctg ttt gga ttt acc get 

Val Tyr Leu Ala Leu Ser Asp His Ala Arg Leu Phe Gly Phe Thr Ala 

'645 650 655 



1824 



1872 



1920 



1968 



gaa gat att atg gat ttt tgg caa cac aaa gec cca caa aaa tac tct 2016 
Glu Asp He Met Asp Phe Trp Gin His Lys Ala Pro Gin Lys Tyr Ser 
660 665 670 

gee ttt gag ttg get ttt gaa ttt ggt cac egg gtt att get gaa tta 2064 
Ala Phe Glu Leu Ala Phe Glu Phe Gly His Arg Val He Ala Glu Leu 
675 680 685 

ate ctt aat aca tta aat aag atg get gaa age ttt ggc ttt acg gat 2112 
He Leu Asn Thr Leu Asn Lys Met Ala Glu Ser Phe Gly Phe Thr Asp 
690 695 700 

aac cct cga tac att gcg gag aaa aat tat atg gaa get tta etc aaa 
Asn Pro Arg Tyr He Ala Glu Lys Asn Tyr Met Glu Ala Leu Leu Lys 
705 ' 710 715 720 

aaa gca tct ccc cat acc gta cgc taa 
Lys Ala Ser Pro His Thr Val Arg * 
725 



2160 



2187 



<210> 130 
<211> 255 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . . (255) 

<400> 130 

atg acc gat aaa ate cgt act ctg caa ggt cgc gtt gtt age gac aaa 

Met Thr Asp Lys He Arg Thr Leu Gin Gly Arg Val Val Ser Asp Lys 
1 5 10 ,15 

atg gag aaa tec att gtt gtt get ate gaa cgt ttt gtg aaa cac ccg 
Met Glu Lys Ser He Val Val Ala He Glu Arg Phe Val Lys His Pro 
20 25 30 

ate tac ggt aaa ttc ate aag cgt acg acc aaa ctg cac gta cat gac 
He Tyr Gly Lys Phe He Lys Arg Thr Thr Lys Leu His Val His Asp 
35 40 • 45 



48 



96 



144 
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gag aac aac gaa tgc ggt ate ggt gac 
Glu Asn Asn Glu Cys Gly lie Gly Asp 
50 55 

cgt ccg ctg tec aag act aaa tec tgg 
Arg Pro Leu Ser Lys Thr Lys Ser Trp 
65 70 

aaa gcg gtt ctg taa 
Lys Ala Val Leu * 



gtg gtt gaa ate egc gaa tgc 192 
Val Val Glu He Arg Glu Cys 
60 

acg ctg gtt egc gtt gta gag 240 
Thr Leu Val Arg Val Val Glu 
75 80 

255 



<210> 131 
<211> 192 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . . (192) 

<400> 131 

atg aaa gca aaa gag ctg cgt gag aag age gtt gaa gag ctg aac ace 4 8 

Met Lys Ala Lys Glu Leu Arg Glu Lys Ser Val Glu Glu Leu Asn Thr 
1 5 10 15 

gag ctg ctg aac ctg ctg cgt gag cag ttc aac ctg cgt atg cag get 96 
Glu Leu Leu Asn Leu Leu Arg Glu Gin Phe Asn Leu Arg Met Gin Ala 
20 25 30 

gca agt ggc cag ctg caa cag tct cac ctg ttg aag caa gtg cgt egc 144 
Ala Ser Gly Gin Leu Gin Gin Ser His Leu Leu Lys Gin Val Arg Arg 
35 40 45 

gat gtc gca egc gtt aag act tta ctg aac gag aag gcg ggt gcg taa 192 
Asp Val Ala Arg Val Lys Thr Leu Leu Asn Glu Lys Ala Gly Ala * 
50 55 60 



<210> 132 
<211> 411 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . - . (411) 

<400> 132 

atg tta caa cca aag cgt aca aaa ttc cgt aaa atg cac aaa ggc cgt 48 

Met Leu Gin Pro Lys Arg Thr Lys Phe Arg Lys Met His Lys Gly Arg 

15 10 15 

aac egc ggt ctg gcg cag ggt acg gat gtt age ttc ggc age ttc ggt 96 
Asn Arg Gly Leu Ala Gin Gly Thr Asp Val Ser Phe Gly Ser Phe Gly 
20 25 30 

ctg aaa get gtt ggc cgt ggt cgt ctg act gee cgt cag ate gaa gca 144 
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Leu Lys Ala Val Gly Arg Gly Arg Leu Thr Ala Arg Gin lie Glu Ala 
35 40 45 

gca cgt cgt get atg acc cgt gca gtt aag cgt caa ggt aag ate tgg 192 
Ala Arg Arg Ala Met Thr Arg Ala Val Lys Arg Gin Gly Lys lie Trp 
50 55 60 

ate cgt gtg ttc ccg gac aaa ccg ate act gaa aag ccg ctg gca gtg 240 
lie Arg Val Phe Pro Asp Lys Pro lie Thr Glu Lys Pro Leu Ala Val 
65 ' 70 75 80 

cgt atg ggt aaa ggt aaa ggt aac gtg gag tat tgg gtt gec ttg att 288 
Arg Met Gly Lys Gly Lys Gly Asn Val Glu Tyr Trp Val Ala Leu He 
85 90 95 

cag ccg ggt aaa gtc ctg tat gaa atg gac ggt gtt ccg gaa gag ctg 336 
Gin Pro Gly Lys Val Leu Tyr Glu Met Asp Gly Val Pro Glu Glu Leu 
100 105 110 



gee cgt gaa gca ttc aag ctg gca gca gcg aaa ctg ccg att aaa acc 
Ala Arg Glu Ala Phe Lys Leu Ala Ala Ala Lys Leu Pro He Lys Thr 
115 120 125 



cca tgg aac tct acc tgg ttt gcg aac acc aaa gaa ttc get gac aac 
Pro Trp Asn Ser Thr Trp Phe Ala Asn Thr Lys Glu Phe Ala Asp Asn 
20 ~ 25 30 



aaa gcg tec gta tct cgt ate gtt ate gag cgt ccg get aag age ate 
Lys Ala Ser Val Ser Arg He Val He Glu Arg Pro Ala Lys Ser He 
50 55 60 

cgt gta acc att cac act get cgc ccg ggt ate gtt ate ggt aaa aaa 
Arg Val Thr He His Thr Ala Arg Pro Gly He Val He Gly Lys Lys 
65 70 75 80 

ggt gaa gac gta gaa aaa ctg cgt aag qtc gta gcg gac ate get ggc 
Gly Glu Asp Val Glu Lys Leu Arg Lys Val Val Ala Asp He Ala Gly 



384 



acc ttt gta act aag acg gtg atg taa 411 
Thr Phe Val Thr Lys Thr Val Met * 
130 135 



<210> 133 
<211> 702 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . . (702) 

<400> 133 

atg ggt cag aaa gta cat cct aat ggt att cgc ctg ggt att gta aaa 48 

Met Gly Gin Lys Val His Pro Asn Gly He Arg Leu Gly He Val Lys 
1 " 5 10 15 



96 



ctg gac age gat ttt aaa gta cgt cag tac ctg act aag gaa ctg get 144 
Leu Asp Ser Asp Phe Lys Val Arg Gin Tyr Leu Thr Lys Glu Leu Ala 
35 40 45 



192 



240 



288 



-35- 



WO 01/34810 



PCT/US00/30950 



85 90 95 

gtt cct gca cag ate aac ate gec gaa gtt cgt aag cct gaa ctg gac 336 
Val Pro Ala Gin He Asn He Ala Glu Val Arg Lys Pro Glu Leu Asp 
100 105 110 



gca aaa ctg gtt get gac age ate act tct cag ctg gaa cgt cgc gtt 
Ala Lys Leu Val Ala Asp Ser He Thr Ser Gin Leu Glu Arg Arg Val 
115 120 125 



384 



atg ttc cgt cgt get atg aag cgt get gta cag aac gca atg cgt ctg 432 
Met Phe Arg Arg Ala Met Lys Arg Ala Val Gin Asn Ala Met Arg Leu 
130 135 140 

ggc get aaa ggt att aaa gtt gaa gtt age ggc cgt ctg ggc ggc gcg 480 
Gly Ala Lys Gly He Lys Val Glu Val Ser Gly Arg Leu Gly Gly Ala 
145 150 155 160 

gaa ate gca cgt acc gaa tgg tac cgc gaa ggt cgc gta ccg ctg cac 528 
Glu He Ala Arg Thr Glu Trp Tyr Arg Glu Gly Arg Val Pro Leu His 
165 170 175 

act ctg cgt get gac ate gac tac aac acc tct gaa gcg cac acc act 57 6 

Thr Leu Arg Ala Asp He Asp Tyr Asn Thr Ser Glu Ala His Thr Thr 
180 " 185 190 

tac ggt gta ate ggc gtt aaa gtg tgg ate ttc aaa ggc gag ate ctg 624 
Tyr Gly Val He Gly Val Lys Val Trp He Phe Lys Gly Glu He Leu 
195 " 200 205 

ggt ggt atg get get gtt gaa caa ccg gaa aaa ccg get get cag cet 672 
Gly Gly Met Ala Ala Val Glu Gin Pro Glu Lys Pro Ala Ala Gin Pro 
210 215 220 

aaa aag cag cag cgt aaa ggc cgt aaa taa 7 02 

Lys Lys Gin Gin Arg Lys Gly Arg Lys * 
225 " 230 



<210> 134 
<211> 333 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . . (333) 

<400> 134 

atg gaa act ate get aaa cat cgc cat get cgt tct tct get cag aag 48 

Met Glu Thr He Ala Lys His Arg His Ala Arg Ser Ser Ala Gin Lys 
1 5 10 15 

gtt cgc ctt gtt get gac ctg att cgc ggt aag aaa gtg teg cag get 96 
Val Arg Leu Val Ala Asp Leu He Arg Gly Lys Lys Val Ser Gin Ala 
20 25 30 

ctg gat att ttg acc tac acc aac aag aaa gcg get gta ctg gtc aag 144 
Leu Asp He Leu Thr Tyr Thr Asn Lys Lys Ala Ala Val Leu Val Lys 
35 40 45 
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aaa gtt ctg gaa tct gcc att get aac get gaa cac aac gat ggc get 192 
Lys Val Leu Glu Ser Ala lie Ala Asn Ala Glu His Asn Asp Gly Ala 
50 55 60 

gac att gac gat ctg aaa gtt acg aaa att ttc gta gac gaa ggc ccg 240 
Asp lie Asp Asp Leu Lys Val Thr Lys He Phe Val Asp Glu Gly Pro 
65 ™ 7 5 80 

age atg aag cgc att atg ccg cgt gca aaa ggt cgt gca gat cgc ate 
Ser Met Lys Arg He Met Pro Arg Ala Lys Gly Arg Ala Asp Arg He 
85 90 95 

ctg aag cgc ace age cac ate act gtg gtt gtg tec gat cgc tga 333 
Leu Lys Arg Thr Ser His He Thr Val Val Val Ser Asp Arg * 
100 105 110 



288 



<210> 135 
<211> 279 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . . (279) 

<400> 135 

atg cca cgt tct etc aag aaa ggt cct ttt att gac ctg cac ttg ctg 4 8 

Met Pro Arg Ser Leu Lys Lys Gly Pro Phe He Asp Leu His Leu Leu 
1 5 ~ 10 15 

aag aag gta gag aaa gcg gtg gaa age gga gac aag aag ccc ctg cgc 96 
Lys Lys Val Glu Lys Ala Val Glu Ser Gly Asp Lys Lys Pro Leu Arg 
20 25 30 

act tgg tec cgt cgt tea acg ate ttt cct aac atg ate ggt ttg acc 144 
Thr Trp Ser Arg Arg Ser Thr He Phe Pro Asn Met He Gly Leu Thr 
35 40 45 

ate get gtc cat aat ggt cgt cag cac gtt ccg gta ttt gta acc gac 192 
He Ala Val His Asn Gly Arg Gin His Val Pro Val Phe Val Thr Asp 
50 55 60 

gaa atg gtt ggt cac aaa ctg ggt gaa ttc gca ccg act cgt act tat 240 
Glu Met Val Gly His Lys Leu Gly Glu Phe Ala Pro Thr Arg Thr Tyr 
65 70 75 80 

cgc ggc cac get get gat aaa aaa gcg aag aag aaa taa 279 
Arg Gly His Ala Ala Asp Lys Lys Ala Lys Lys Lys * 
85 90 



<210> 136 
<211> 822 
<212> DNA 

<213> Escherichia coli 

<220> 
<221> CDS 
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<222> (1) . (822) 
<400> 136 

atg gca gtt gtt aaa tgt aaa ccg aca tct ccg ggt cgt cgc cac gta 
Met Ala Val Val Lys Cys Lys Pro Thr Ser Pro Gly Arg Arg His Val 
15 10 15 

gtt aaa gtg gtt aac cct gag ctg cac aag ggc aaa cct ttt get ccg 
Val Lys Val Val Asn Pro Glu Leu His Lys Gly Lys Pro Phe Ala Pro 
20 25 30 

ttg ctg gaa aaa aac age aaa tec ggt ggt cgt aac aac aat ggc cgt 
Leu Leu Glu Lys Asn Ser Lys Ser Gly Gly Arg Asn Asn Asn Gly Arg 
35 40 45 

ate acc act cgt cat ate ggt ggt ggc cac aag cag get tac cgt att 
lie Thr Thr Arg His lie Gly Gly Gly His Lys Gin Ala Tyr Arg lie 
50 55 60 

gtt gac ttc aaa cgc aac aaa gac ggt ate ccg gca gtt gtt gaa cgt 
Val Asp Phe Lys Arg Asn Lys Asp Gly lie Pro Ala Val Val Glu Arg 
65 " 70 75 80 

ctt gag tac gat ccg aac cgt tec gcg aac ate gcg ctg gtt ctg tac 
Leu Glu Tyr Asp Pro Asn Arg Ser Ala Asn lie Ala Leu Val Leu Tyr 
85 90 95 



ggc gac cag att cag tct ggc gtt gat get gca ate aaa cca ggt aac 
Gly Asp Gin lie Gin Ser Gly Val Asp Ala Ala lie Lys Pro Gly Asn 
, 115 120 125 



gaa atg aaa cca ggt aaa ggc ggt cag ctg gca cgt tec get ggt act 
Glu Met Lys Pro Gly Lys Gly Gly Gin Leu Ala Arg Ser Ala Gly Thr 
145 * 150 155 160 



ggc gaa gtt ggc aat get gag cat atg ctg cgc gtt ctg ggt aaa gca 
Gly Glu Val Gly Asn Ala Glu His Met Leu Arg Val Leu Gly Lys Ala 
195 200 205 



atg aac ccg gta gac cac cca cat ggt ggt ggt gaa ggt cgt aac ttt 

-38- 



48 



96 



144 



192 



240 



288 



aaa gac ggt gaa cgc cgt tac ate ctg gec cct aaa ggc ctg aaa get 336 
Lys Asp Gly Glu Arg Arg Tyr He Leu Ala Pro Lys Gly Leu Lys Ala 
100 105 HO 



384 



acc ctg ccg atg cgc aac ate ccg gtt ggt tct act gtt cat aac gta 432 
Thr Leu Pro Met Arg Asn He Pro Val Gly Ser Thr Val His Asn Val 
130 135 140 



480 



tac gtt cag ate gtt get cgt gat ggt get tat gtc acc ctg cgt ctg 528 
Tyr Val Gin He Val Ala Arg Asp Gly Ala Tyr Val Thr Leu Arg Leu 
165 170 175 

cgt tct ggt gaa atg cgt aaa gta gaa gca gac tgc cgt gca act ctg 576 
Arg Ser Gly Glu Met Arg Lys Val Glu Ala Asp Cys Arg Ala Thr Leu 
180 185 190 



624 



ggt get gca cgc tgg cgt ggt gtt cgt ccg acc gtt cgc ggt acc gcg 672 
Gly Ala Ala Arg Trp Arg Gly Val Arg Pro Thr Val Arg Gly Thr Ala 
210 215 220 



720 
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Met Asn Pro Val Asp His Pro His Gly Gly Gly Giu Gly Arg Asn Phe 
225 230 235 240 

ggt aag cac ccg gta act ccg tgg ggc gtt cag acc aaa ggt aag aag 
Gly Lys His Pro Val Thr Pro Trp Gly Val Gin Thr Lys Gly Lys Lys 
245 250 255 

acc cgc age aac aag cgt act gat aaa ttc ate gta cgt cgc cgt age 
Thr Arg Ser Asn Lys Arg Thr Asp Lys Phe He Val Arg Arg Arg Ser 
260 265 270 

aaa taa 
Lys * 



768 



816 



822 



<210> 137 
<211> 303 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . . (303) 

<400> 137 

atg att cgt gaa gaa cgt ctg ctg aag gtg ctg cgt gca ccg cac gtt 

Met He Arg Glu Glu Arg Leu Leu Lys Val Leu Arg Ala Pro His Val 
1 5 10 15 

tct gaa aaa gcg tct act gcg atg gaa aaa tec aac acc ate gta etc 
Ser Glu Lys Ala Ser Thr Ala Met Glu Lys Ser Asn Thr He Val Leu 
20 25 30 

aaa gtt get aaa gac gcg acc aaa gca gaa ate aaa get get gtg cag 
Lys Val Ala Lys Asp Ala Thr Lys Ala Glu He Lys Ala Ala Val Gin 
35 40 4 5 

aaa ctg ttt gaa gtc gaa gtc gaa gtc gtt aac acc ctg gta gtt aaa 
Lys Leu Phe Glu Val Glu Val Glu Val Val Asn Thr Leu Val Val Lys 
50 55 60 

ggg aaa gtt aaa cgt cac gga cag cgt ate ggt cgt cgt age gac tgg 
Gly Lys Val Lys Arg His Gly Gin Arg He Gly Arg Arg Ser Asp Trp 
65 70 75 80 

aaa aaa get tac gtc acc ctg aaa gaa ggc cag aat ctg gac ttc gtt 
Lys Lys Ala Tyr Val Thr Leu Lys Glu Gly Gin Asn Leu Asp Phe Val 
85 90 95 

ggc ggc get gag taa 
Gly Gly Ala Glu * 
100 



48 



96 



144 



192 



240 



288 



303 



<210> 138 
<211> 606 
<212> DNA 

<213> Escherichia coli 
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<220> 

<221> CDS 

<222> (1) . . . (606) 

<400> 138 

atg gaa tta gta ttg aaa gac gcg cag age gcg ctg act gtt tec gaa 48 

Met Glu Leu Val Leu Lys Asp Ala Gin Ser Ala Leu Thr Val Ser Glu 

15 10 15 

act acc ttc ggt cgt gat ttc aac gaa gcg ctg gtt cac cag gtt gtt 96 
Thr Thr Phe Gly Arg Asp Phe Asn Glu Ala Leu Val His Gin Val Val 
20 25 30 



gtt get tat gca get ggt get cgt cag ggt act cgt get cag aag act 

Val Ala Tyr Ala Ala Gly Ala Arg Gin Gly Thr Arg Ala Gin Lys Thr 

35 40 45 

cgt get gaa gta act ggt tec ggt aaa aaa ccg tgg cgc cag aaa ggc 

Arg Ala Glu Val Thr Gly Ser Gly Lys Lys Pro Trp Arg Gin Lys Gly 

50 55 60 

acc ggc cgt gcg cgt tct ggt tct ate aag age ccg ate tgg cgt tct 

Thr Gly Arg Ala Arg Ser Gly Ser He Lys Ser Pro lie Trp Arg Ser 

65 70 75 80 



aac aag aag atg tac cgc ggc gcg ctg aaa age ate ctg tec gaa ctg 

Asn Lys Lys Met Tyr Arg Gly Ala Leu Lys Ser He Leu Ser Glu Leu 
100 105 110 

gta cgt cag gat cgt ctg ate gtt gtc gag aag ttc tct gta gaa gcg 

Val Arg Gin Asp Arg Leu He Val Val Glu Lys Phe Ser Val Glu Ala 
115 " 120 125 

ccg aaa act aag ctg ctg gca cag aaa ctg aaa gac atg get ctg gaa 

Pro Lys Thr Lys Leu Leu Ala Gin Lys Leu Lys Asp Met Ala Leu Glu 
130 * 135 140 

gat gtg ctg ate ate acc ggt gag ctg gac gaa aac ctg ttc ctg get 

Asp Val Leu He He Thr Gly Glu Leu Asp Glu Asn Leu Phe Leu Ala 

145 150 155 160 

gcg cgc aac ctg cac aag gtt gac gta cgc gat gca act ggt ate gac 
Ala Arg Asn Leu His Lys Val Asp Val Arg Asp Ala Thr Gly He Asp 

165 170 175 



144 



192 



240 



ggt ggc gtg acc ttt get get cgt ccg cag gac cac agt caa aaa gtt 288 
Gly Gly Val Thr Phe Ala Ala Arg Pro Gin Asp His Ser Gin Lys Val 
85 90 95 



336 



384 



432 



480 



528 



606 



ccg gtt age ctg ate gee ttc gac aaa gtc gta atg act get gat get 576 

Pro Val Ser Leu He Ala Phe Asp Lys Val Val Met Thr Ala Asp Ala 
180 185 190 

gtt aag caa gtt gag gag atg ctg gca tga 

Val Lys Gin Val Glu Glu Met Leu Ala * 
195 200 



<210> 139 
<211> 630 
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<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . . (639) 

<400> 139 

atg att ggt tta gtc ggt aaa aaa gtg ggt atg acc cgt ate ttc aca 

Met lie Gly Leu Val Gly Lys Lys Val Gly Met Thr Arg lie Phe Thr 
1 5 10 15 

gaa gac ggc gtt tct ate cca gta acc gta ate gaa gtt gaa gca aac 
Glu Asp Gly Val Ser He Pro Val Thr Val He Glu Val Glu Ala Asn 
20 25 30 

cgc gtt act cag gtt aaa gac ctg get aac gat ggc tac cgt get att 
Arg Val Thr Gin Val Lys Asp Leu Ala Asn Asp Gly Tyr Arg Ala He 
35 40 45 

cag gtg acc acc ggt get aaa aaa get aac cgt gtg acc aag cct gaa 
Gin Val Thr Thr Gly Ala Lys Lys Ala Asn Arg Val Thr Lys Pro Glu 
50 55 60 

get ggc cac ttc get aaa get ggc gta gaa get ggc cgt ggt ctg tgg 
Ala Gly His Phe Ala Lys Ala Gly Val Glu Ala Gly Arg Gly Leu Trp 
65 70 75 80 

gaa ttc cgc ctg get gaa ggc gaa gag ttc act gta ggt cag age att 
Glu Phe Arg Leu Ala Glu Gly Glu Glu Phe Thr Val Gly Gin Ser He 
85 90 95 

age gtt gaa ctg ttt get gac gtt aaa aaa gtt gac gta act ggc acc 
Ser Val Glu Leu Phe Ala Asp Val Lys Lys Val Asp Val Thr Gly Thr 
100 105 HO 

tct aaa ggt aaa ggt ttc gca ggt acc gtt aag cgc tgg aac tec cgt 
Ser Lys Gly Lys Gly Phe Ala Gly Thr Val Lys Arg Trp Asn Phe Arg 
115 120 125 



tct ate ggt cag aac cag act ccg ggc aaa gtg ttc aaa ggc aag aaa 
Ser He Gly Gin Asn Gin Thr Pro Gly Lys Val Phe Lys Gly Lys Lys 
145 150 155 160 

atg gca ggt cag atg ggt aac gaa cgt gta acc gtt cag age ctt gac 
Met Ala Gly Gin Met Gly Asn Glu Arg Val Thr Val Gin Ser Leu Asp 
165 170 175 



gtc ccg ggt gca acc ggt age gac ctg ate gtt aaa cca get gtg aag 
Val Pro Gly Ala Thr Gly Ser Asp Leu He Val Lys Pro Ala Val Lys 
195 200 205 



48 



96 



144 



192 



240 



288 



336 



384 



acc cag gac get act cac ggt aac tec ttg tct cac cgc gtt ccg ggt 4 32 

Thr Gin Asp Ala Thr His Gly Asn Ser Leu Ser His Arg Val Pro Gly 
130 135 140 



480 



528 



gta gta cgc gtt gac get gag cgc aac ctg ctg ctg gtt aaa ggt get 57 6 

Val Val Arg Val Asp Ala Glu Arg Asn Leu Leu Leu Val Lys Gly Ala 
180 185 190 



624 
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gcg taa 630 
Ala * * * 



<210> 140 
<211> 312 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . . (312) 

<400> 140 

atg cag aac caa aga ate cgt ate cgc ctg aaa gcg ttt gat cat cgt 

Met Gin Asn Gin Arg He Arg He Arg Leu Lys Ala Phe Asp His Arg 
I 5 10 15 

ctg ate gat caa gca aec gcg gaa ate gtc gag act gee aag cgc act 
Leu He Asp Gin Ala Thr Ala Glu He Val Glu Thr Ala Lys Arg Thr 
20 25 30 



48 



96 



ggt gcg cag gtc cgt ggt ccg ate ccg ctg ccg aca cgc aaa gag cgc 14 4 

Gly Ala Gin Val Arg Gly Pro He Pro Leu Pro Thr Arg Lys Glu Arg 

35 " 40 45 

ttc act gtt ctg ate tec ccg cac gtc aac aaa gac gcg cgc gat cag 192 

Phe Thr Val Leu He Ser Pro His Val Asn Lys Asp Ala Arg Asp Gin 
50 55 60 

tac gaa ate cgt act cac ttg cgt ctg gtt gac ate gtt gag cca acc 

Tyr Glu He Arg Thr His Leu Arg Leu Val Asp He Val Glu Pro Thr 
65 70 75 80 

gag aaa acc gtt gat get ctg atg cgt ctg gat ctg get gec ggt gta 

Glu Lys Thr Val Asp Ala Leu Met Arg Leu Asp Leu Ala Ala Gly Val 
85 90 95 

gac gtg cag ate age ctg ggt taa 312 
Asp Val Gin He Ser Leu Gly * 
100 



240 



288 



<210> 141 
<211> 4710 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) ... (4710) 

<400> 141 

gtg aac aca ata cac ttg cgc tgt etc ttc agg atg aat ccc ctg gtc 
Met Asn Thr He His Leu Arg Cys Leu Phe Arg Met Asn Pro Leu Val 
15 10 15 

tgg tgc ctg tgg get gat gtt gca gca aag eta agg teg ctt aaa cgc 
Trp Cys Leu Trp Ala Asp Val Ala Ala Lys Leu Arg Ser Leu Lys Arg 
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20 25 30 

tac tea gta ttc act ttt cag agg atg aaa ttt atg aac agg acc agt 
Tyr Ser Val Phe Thr Phe Gin Arg Met Lys Phe Met Asn Arg Thr Ser 
35 40 45 

ccc tat tat tgt cgc cgc tea gta ctt tec tta ttg ata tct gee ttg 
Pro Tyr Tyr Cys Arg Arg Ser Val Leu Ser Leu Leu He Ser Ala Leu 
50 " 55 60 

ata tat gee ccg ccc ggg atg get gec ttc act act aat gtt a t tt ggt 
He Tyr Ala Pro Pro Gly Met Ala Ala Phe Thr Thr Asn Val He Gly 
65 70 75 80 

gtg gta aac gat gag act gta gat ggc aac caa aaa gtg gat gaa cga 
Val Val Asn Asp Glu Thr Val Asp Gly Asn Gin Lys Val Asp Glu Arg 
85 90 95 

ggt aca aca aat aac act cat att ate aac cat ggc cag cag aat gtt 
Gly Thr Thr Asn Asn Thr His He He Asn His Gly Gin Gin Asn Val 
100 105 HO 

cat ggc ggg gta tct aat gga agt ctt att gaa tct ggt gga tat caa 
His Gly Gly Val Ser Asn Gly Ser Leu He Glu Ser Gly Gly Tyr Gin 
115 120 125 



gcg aat gga ata etc att gat ggt ggt age cag ata gta aaa gtt caa 
Ala Asn Gly He Leu Tie Asp Gly Gly Ser Gin lie Val Lys Val Gin 
195 200 205 



144 



192 



240 



288 



336 



384 



gat ata gga agt cat aac aat ttt gtg ggg cag get aat aat aca acc 4 32 

Asp He Gly Ser His Asn Asn Phe Val Gly Gin Ala Asn Asn Thr Thr 
130 135 140 



480 



528 



att aac ggt ggc aga cag tea att cat gac ggg ggt att tec aca ggt 
He Asn Gly Gly Arg Gin Ser He His Asp Gly Gly He Ser Thr Gly 
145 150 155 160 

acg aca ate gag agt ggc aat cag gac gtt tat aaa ggg ggt ate age 
Thr Thr He Glu Ser Gly Asn Gin Asp Val Tyr Lys Gly Gly He Ser 
165 170 175 

aat gga acg aca att aag ggc ggt get tea cgc gta gag gga ggg agt 57 6 

Asn Gly Thr Thr He Lys Gly Gly Ala Ser Arg Val Glu Gly Gly Ser 
180 185 190 



624 



ggg cat get gat ggt aca acg ata aat aag tct ggc tct cag gac gta 672 

Gly His Ala Asp Gly Thr Thr He Asn Lys Ser Gly Ser Gin Asp Val 
210 ' 215 220 

gta caa gga agt ctg gca acg aac aca acc ata aat ggt ggt cga cag 720 

Val Gin Gly Ser Leu Ala Thr Asn Thr Thr He Asn Gly Gly Arg Gin 
225 ' 230 235 240 

tat gtt gaa cag age aca gta gaa aca aca acc att aaa aat ggc ggt 

Tyr Val Glu Gin Ser Thr Val Glu Thr Thr Thr He Lys Asn Gly Gly 
245 250 255 

gag caa aga gta tat gag age cgt gcg ctg gac acg acg att gaa ggc 
Glu Gin Arg Val Tyr Glu Ser Arg Ala Leu Asp Thr Thr He Glu Gly 



768 



816 
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260 265 270 

gga act cag tct ctg aat agt aag tea acg gca aaa aat acg cat ate 864 
Gly Thr Gin Ser Leu Asn Ser Lys Ser Thr Ala Lys Asn Thr His lie 
275 280 285 

tat tct ggt ggc acg caa att gtt gat aac acc age ace teg gat gtt 912 
Tyr Ser Gly Gly Thr Gin lie Val Asp Asn Thr Ser Thr Ser Asp Val 
290 295 300 

att gaa gtt tat tct ggt ggc gtg ctt gat gtt agg ggt ggt acg gca 960 
lie Glu Val Tyr Ser Gly Gly Val Leu Asp Val Arg Gly Gly Thr Ala 
305 310 315 320 

aca aat gtt acc cag cac gat ggt gca att tta aaa act aac act aac 1008 
Thr Asn Val Thr Gin His Asp Gly Ala Tie Leu Lys Thr Asn Thr Asn 
325 330 335 

ggt acg acg gtg age ggt acg aat agt gaa ggt gca ttc tec ate cac 1056 
Gly Thr Thr Val Ser Gly Thr Asn Ser Glu Gly Ala Phe Ser lie His 
340 345 350 

aat cac gtg gca gac aat gtg ttg ctg gaa aac ggt ggt cat tta gac 1104 
Asn His Val Ala Asp Asn Val Leu Leu Glu Asn Gly Gly His Leu Asp 
355 360 365 

ata aac gca tat ggt teg gca aac aag acg att att aaa gat aaa gga 1152 
lie Asn Ala Tyr Gly Ser Ala Asn Lys Thr He He Lys Asp Lys Gly 
370 375 380 

aca atg tea gtt tta acc aat get aaa get gat gcg acc cga ata gat 1200 
Thr Met Ser Val Leu Thr Asn Ala Lys Ala Asp Ala Thr Arg He Asp 
385 390 395 400 

aat ggc ggg gtt atg gat gtt gca gga aac gcg aca aat acc ata att 1248 
Asn Gly Gly Val Met Asp Val Ala Gly Asn Ala Thr Asn Thr He He 
405 410 415 

aat ggt ggc aca cag aat att aat aat tat ggc ata gec aca ggc acc 1296 
Asn Gly Gly Thr Gin Asn He Asn Asn Tyr Gly He Ala Thr Gly Thr 
420 425 430 

aat ate aac age gga acg caa aat ate aaa age ggc ggg aaa get gac 1344 
Asn He Asn Ser Gly Thr Gin Asn He Lys Ser Gly Gly Lys Ala Asp 
435 440 445 

aca aca att ata tec tec ggg age egg cag gtt gtt gag aaa gat ggt 1392 
Thr Thr He He Ser Ser Gly Ser Arg Gin Val Val Glu Lys Asp Gly 
450 455 460 

acg gca att ggc age aat att age gec gga ggc teg ctg att gtc tat 14 40 
Thr Ala He Gly Ser Asn He Ser Ala Gly Gly Ser Leu lie Val Tyr 
465 * 470 475 480 

acc ggc ggt att gca cat ggg gtt aac cag gag acg ggc agt get tta 14 88 
Thr Gly Gly He Ala His Gly Val Asn Gin Glu Thr Gly Ser Ala Leu 
485 490 495 

gtt gec aac acg ggt gca ggg act gat ate gaa gga tac aac aag etc 1536 
Val Ala Asn Thr Gly Ala Gly Thr Asp He Glu Gly Tyr Asn Lys Leu 



-44- 



WO 01/34810 



PCT/USOO/30950 



500 505 510 

tct cac ttc act att acc gga ggg gag get aat tat gtt gtg ctg gaa 1584 
Ser His Phe Thr He Thr Gly Gly Glu Ala Asn Tyr Val Val Leu Glu - 
515 520 525 



aat acc ggc gaa ctg acg gta gtg get aaa acc teg gcg aaa aat act 
Asn Thr Gly Glu Leu Thr Val Val Ala Lys Thr Ser Ala Lys Asn Thr 
530 535 540 

acc att gat act ggc ggt aag ctg att gtc cag aag gag get aaa aca 
Thr He Asp Thr Gly Gly Lys Leu He Val Gin Lys Glu Ala Lys Thr 
545 550 555 560 



aca gag aaa aca cac ate aat ggt ggc acg caa acc gtt cag aat tat 
Thr Glu Lys Thr His He Asn Gly Gly Thr Gin Thr Val Gin Asn Tyr 
705 " 710 715 720 

ggt aag gca ate aat acc gat ate gtc tct ggc eta caa caa att atg 
Gly Lys Ala He Asn Thr Asp He Val Ser Gly Leu Gin Gin He Met 
725 730 735 

gca aac ggg aca gcg gaa ggt tec att att aat ggg ggt tea cag gta 
Ala Asn Gly Thr Ala Glu Gly Ser He He Asn Gly Gly Ser Gin Val 



1632 



1680 



gat age acc aga ctt aat aat ggc ggc gtt ctg gag gtt cag gac ggt 1728 
Asp Ser Thr Arg Leu Asn Asn Gly Gly Val Leu Glu Val Gin Asp Gly 
565 570 575 

ggt gag get aag cat gtt gag caa caa tec ggc ggc gca tta att get 1776 
Gly Glu Ala Lys His Val Glu Gin Gin Ser Gly Gly Ala Leu He Ala 
580 585 590 

tec acg acc tec gga aca ctt ate gaa gga acc aac agt tat ggt gat 1824 
Ser Thr Thr Ser Gly Thr Leu He Glu Gly Thr Asn Ser Tyr Gly Asp 
595 600 605 

get ttc tac ate agg aat tea gaa get aaa aat gta gtg ctg gaa aac 
Ala Phe Tyr He Arg Asn Ser Glu Ala Lys Asn Val Val Leu Glu Asn 
610 615 620 

get ggc tea tta aca gtc gtc act ggt tec cgc gca gtt gac acg att 
Ala Gly Ser Leu Thr Val Val Thr Gly Ser Arg Ala Val Asp Thr He 
625 630 635 640 

att aat gee aac ggc aaa atg gat gtt tat gga aaa gat gtt ggc act 
He Asn Ala Asn Gly Lys Met Asp Val Tyr Gly Lys Asp Val Gly Thr 
645 650 655 

gta etc aat agt get ggc acc caa aca ata tat gec agt gee act tct 
Val Leu Asn Ser Ala Gly Thr Gin Thr He Tyr Ala Ser Ala Thr Ser 
660 665 670 

gat aaa gca aat ate aaa ggt ggc aag caa acg gta tat ggt tta gee 
Asp Lys Ala Asn He Lys Gly Gly Lys Gin Thr Val Tyr Gly Leu Ala 
675 680 685 

act gaa gca aat ate gaa agt ggt gaa caa att gtt gat ggt ggg tea 2112 
Thr Glu Ala Asn He Glu Ser Gly Glu Gin He Val Asp Gly Gly Ser 
690 695 700 



1872 



1920 



1968 



2016 



2064 



2160 



2208 



2256 
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740 745 750 

gtt aat gag ggc ggc ctg get gaa aac teg gtg ctt aat gac ggc ggc 2304 
Val Asn Glu Gly Gly Leu Ala Glu Asn Ser Val Leu Asn Asp Gly Gly 
755 ' 760 765 

aca etc gat gtg egg gag aaa ggc age gca acg ggg ata cag cag agt 2352 
Thr Leu Asp Val Arg Glu Lys Gly Ser Ala Thr Gly lie Gin Gin Ser 
770 775 780 



age cag ggc get ttg gtt gca ace ace agg gcg acg egg gtc aca gga 
Ser Gin Gly Ala Leu Val Ala Thr Thr Arg Ala Thr Arg Val Thr Gly 
785 " 790 795 800 



2400 



aca cgc gcg gat ggc gtc gcg ttc age ate gag cag ggt gcg gcg aac 24 48 

Thr Arg Ala Asp Gly Val Ala Phe Ser He Glu Gin Gly Ala Ala Asn 
805 810 815 

aat ate ctg ctg gca aat ggc ggc gtg tta acc gtg gag tea gac ace 2496 
Asn He Leu Leu Ala Asn Gly Gly Val Leu Thr Val Glu Ser Asp Thr 
820 825 830 

tct tct gac aaa aca cag gtc aat atg ggc gga egg gag ate gtc aaa 2544 
Ser Ser Asp Lys Thr Gin Val Asn Met Gly Gly Arg Glu He Val Lys 
835 ' 840 845 

aca aaa gee act gcg aca ggc acg acg etc acc ggc ggt gaa caa att 2592 
Thr Lys Ala Thr Ala Thr Gly Thr Thr Leu Thr Gly Gly Glu Gin lie 
850 855 860 

gtc gag ggt gtg gcg aat gag aca aca att aac gac ggc gga ata caa 2640 
Val Glu Gly Val Ala Asn Glu Thr Thr He Asn Asp Gly Gly He Gin 
865 870 875 880 

aca gtt tea get aac gga gag gca ata aaa aca aag ate aat gaa ggc 2688 
Thr Val Ser Ala Asn Gly Glu Ala He Lys Thr Lys He Asn Glu Gly 
805 890 895 

ggt acg ctg aca gtc aac gat aat ggc aaa gcg aca gat ate gtc cag 2736 
Gly Thr Leu Thr Val Asn Asp Asn Gly Lys Ala Thr Asp He Val Gin 
900 905 910 

aac age ggt gee get etc cag acg age acg get aac ggt att gaa ate 2784 
Asn Ser Gly Ala Ala Leu Gin Thr Ser Thr Ala Asn Gly He Glu He 
915 920 925 

age ggt act cac cag tac ggt act ttt tec att tec ggc aat tta gcg 2832 
Ser Gly Thr His Gin Tyr Gly Thr Phe Ser He Ser Gly Asn Leu Ala 
930 935 940 

acc aat atg ttg ctg gaa aat ggc ggt aat tta ttg gta tta gca ggt 2880 
Thr Asn Met Leu Leu Glu Asn Gly Gly Asn Leu Leu Val Leu Ala Gly 
945 950 955 960 

acc gaa get cgc gac tec acg gtt ggc aag ggt ggg gca atg caa aac 2928 
Thr Glu Ala Arg Asp Ser Thr Val Gly Lys Gly Gly Ala Met Gin Asn 
965 970 975 

ctg ggt cag gac tec gee aca aag gtt aac tct ggc ggg caa tat acc 2976 
Leu Gly Gin Asp Ser Ala Thr Lys Val Asn Ser Gly Gly Gin Tyr Thr 
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980 985 990 

ctt ggg egg tea aaa gat gag ttt cag get ctg gee egg gca gaa gat 
Leu Gly Arg Ser Lys Asp Glu Phe Gin Ala Leu Ala Arg Ala Glu Asp 
995 1000 1005 

etc cag gtc get ggc ggt acg gca ate gtc tac gca ggt acg ctg gcg 
Leu Gin Val Ala Gly Gly Thr Ala lie Val Tyr Ala Gly Thr Leu Ala 
1010 1015 1020 

gat gca teg gtc agt ggc gcg aca gga age ctg teg tta atg acg cca 
Asp Ala Ser Val Ser Gly Ala Thr Gly Ser Leu Ser Leu Met Thr Pro 
1025 1030 1035 1040 

egg gat aat gtt acg cca gtt aaa etc gaa ggg gcg gtc egg att ace 
Arg Asp Asn Val Thr Pro Val Lys Leu Glu Gly Ala Val Arg lie Thr 
1045 1050 1055 



gac ctg acg get gee age egg ggc agt gtc tgg ctt aac age aat aat 
Asp Leu Thr Ala Ala Ser Arg Gly Ser Val Trp Leu Asn Ser Asn Asn 
1075 1080 1085 



etc aac gac ggt gat gtt tat ttg tea gca caa aca gca gcg cct gee 
Leu Asn Asp Gly Asp Val Tyr Leu Ser Ala Gin Thr Ala Ala Pro Ala 
1105 " 1110 1115 1120 

aca act aac ggt ate tac aat acg ctg aca acc aat gaa ctt tec ggt 
Thr Thr Asn Gly lie Tyr Asn Thr Leu Thr Thr Asn Glu Leu Ser Gly 
1125 1130 1135 



aaa aca ggg gga ggg gat get teg ttt acg ctg ggc aat acc ggc ggt 
Lys Thr Gly Gly Gly Asp Ala Ser Phe Thr Leu Gly Asn Thr Gly Gly 
1185 1190 1195 1200 

ttc gtt gat ctt ggg acc tat gag tat gtc ctg aaa agt gac ggc aac 
Phe Val Asp Leu Gly Thr Tyr Glu Tyr Val Leu Lys Ser Asp Gly Asn 
1205 1210 1215 

age aac tgg aac ctg acc aat gat gtc aaa ccc aac ccg gac ccc ate 
Ser Asn Trp Asn Leu Thr Asn Asp Val Lys Pro Asn Pro Asp Pro lie 



3024 



3072 



3120 



3168 



gat age gcg aca ttg act ctg gga aat ggc gtc gat acc acg ctt gee 3216 
Asp Ser Ala Thr Leu Thr Leu Gly Asn Gly Val Asp Thr Thr Leu Ala 
1060 1065 1070 



3264 



tec tgt gca ggt acc age aac tgc gaa tat aga gta aac agt ttg eta 3312 
Ser Cys Ala Gly Thr Ser Asn Cys Glu Tyr Arg Val Asn Ser Leu Leu 
1090 " 1095 1100 



3360 



3408 



age ggt aat ttc tac ctg cat acc aac gtt gca ggc tec egg ggc gat 34 56 

Ser Gly Asn Phe Tyr Leu His Thr Asn Val Ala Gly Ser Arg Gly Asp 
1140 1145 1150 

caa ctg gtc gtc aac aac aac gee act ggt aat ttt aaa ate ttt gtt 3504 
Gin Leu Val Val Asn Asn Asn Ala Thr Gly Asn Phe Lys He Phe Val 
1155 H60 1165 

cag gat acc ggc gtc age cca cag tct gac gac gcg atg acg ctg gtg . 3552 
Gin Asp Thr Gly Val Ser Pro Gin Ser Asp Asp Ala Met Thr Leu Val 
1170 " H75 1180 



3600 



3648 



3696 
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1220 



1225 



1230 



cca aat cca aag cca gac cca aaa ccc gat cca aag cca gac cca aat 
Pro Asn Pro Lys Pro Asp Pro Lys Pro Asp Pro Lys Pro Asp Pro Asn 
1235 1240 1245 



3744 



cca aaa cca gac cct act ccc gat cca acg ccg aca ccc gtt ccg gag 
Pro Lys Pro Asp Pro Thr Pro Asp Pro Thr Pro Thr Pro Val Pro Glu 
1250 " 1255 1260 

aaa cgc att acg cct tct acg gca gcc gta etc aat atg gca gca aca 
Lys Arg He Thr Pro Ser Thr Ala Ala Val Leu Asn Met Ala Ala Thr 
1265 1270 1275 1280 

tta ccg ttg gta ttt gat get gag eta aac agt att cgc gag egg ttg 
Leu Pro Leu Val Phe Asp Ala Glu Leu Asn Ser He Arg Glu Arg Leu 
1285 1290 1295 



3792 



3840 



3888 



aac ata atg aaa gcg agt cca cac aac aat aat gtc tgg ggg gcg acg 
Asn He Met Lys Ala Ser Pro His Asn Asn Asn Val Trp Gly Ala Thr 
1300 1305 1310 



3936 



tat aac acc cgt aat aat gtc acc acc gat gcg ggt gcc ggg ttt gag 3984 
Tyr Asn Thr Arg Asn Asn Val Thr Thr Asp Ala Gly Ala Gly Phe Glu 
1315 1320 1325 

cag acg ctg acc gga atg aca gtg ggg ate gac age cgt aat gat att 4032 
Gin Thr Leu Thr Gly Met Thr Val Gly He Asp Ser Arg Asn Asp He 
1330 1335 1340 



cct gaa gga att acc acg eta ggc get ttt atg ggc tat tec cat tea 
Pro Glu Gly He Thr Thr Leu Gly Ala Phe Met Gly Tyr Ser His Ser 
1345 ~ 1350 1355 1360 



4080 



cat ate ggt ttt gat cgc gga gga cat ggc agt gtg ggc agt tat tct 
His He Gly Phe Asp Arg Gly Gly His Gly Ser Val Gly Ser Tyr Ser 
1365 1370 1375 



4128 



ctg ggc ggc tat gcc agt tgg gaa cat gaa agt ggt ttc tat ctg gac 
Leu Gly Gly Tyr Ala Ser Trp Glu His Glu Ser Gly Phe Tyr Leu Asp 
1380 1385 1390 



4176 



ggt gtc gtg aag ctg aac cgt ttt aaa agt aac gta gca ggt aaa atg 
Gly Val Val Lys Leu Asn Arg Phe Lys Ser Asn Val Ala Gly Lys Met 
1395 1400 1405 



4224 



age age ggt gga gcc gcc aat ggc agt tac cac age aac ggg ctg ggc 
Ser Ser Gly Gly Ala Ala Asn Gly Ser Tyr His Ser Asn Gly Leu Gly 
1410 1415 1420 

ggt cac att gaa acc ggg atg cga ttt acc gat ggt aac tgg aac ctg 
Gly His He Glu Thr Gly Met Arg Phe Thr Asp Gly Asn Trp Asn Leu 
1425 1430 1435 1440 

acg ccg tat gca teg tta acg ggg ttc acc get gat aac ccc gaa tat 
Thr Pro Tyr Ala Ser Leu Thr Gly Phe Thr Ala Asp Asn Pro Glu Tyr 
1445 1450 1455 



4272 



4320 



4368 



cat tta tec aat ggc atg aaa teg aaa tea gtc gat acc cgc agt ata 
His Leu Ser Asn Gly Met Lys Ser Lys Ser Val Asp Thr Arg Ser He 



4416 
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1460 1465 1470 

tat cgt gaa ctg ggc gca acg ctg agt tac aac atg cgt ctg ggg aac 44 64 
Tyr Arg Glu Leu Gly Ala Thr Leu Ser Tyr Asn Met Arg Leu Gly Asn 
1475 ~ 1480 1485 

ggt atg gaa gtt gag ccg tgg ctg aag gcg get gtg cgc aaa gaa ttt 4512 
Gly Met Glu Val Glu Pro Trp Leu Lys Ala Ala Val Arg Lys Glu Phe 
1490 1495 1500 



gtc gat gat aac egg gtg aaa gtg aat agt gac ggt aat ttc gtc aat 
Val Asp Asp Asn Arg Val Lys Val Asn Ser Asp Gly Asn Phe Val Asn 
1505 1510 1515 1520 

tat ttg teg ggc aga cgt gga ata tac cag gca ggt att aaa gee tea 
Tyr Leu Ser Gly Arg Arg Gly lie Tyr Gin Ala Gly He Lys Ala Ser 
1525 1530 1535 

ttc age agt acg tta age ggg cat ctt ggg gtg ggg tat age cat agt 
Phe Ser Ser Thr Leu Ser Gly His Leu Gly Val Gly Tyr Ser His Ser 
1540 1545 . 1550 

gee ggt gtg gaa tec ccg tgg aac gcg gta get 'ggt gtg aac tgg teg 
Ala Gly Val Glu Ser Pro Trp Asn Ala Val Ala Gly Val Asn Trp Ser 
1555 1560 1565 

ttc tga 
Phe * 



<210> 142 
<211> 117 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . . (117) 

<400> 142 

atg aaa gtt cgt get tec gtc aag aaa tta tgc cgt aac tgc aaa ate 

Met Lys Val Arg Ala Ser Val Lys Lys Leu Cys Arg Asn Cys Lys He 
1 5 10 15 

gtt aag cgt gat ggt gtc ate cgt gtg att tgc agt gee gag ccg aag 
Val Lys Arg Asp Gly Val He Arg Val He Cys Ser Ala Glu Pro Lys 
20 25 30 



4560 



4608 



4656 



4704 



4710 



48 



96 



cat aaa cag cgc caa ggc tga 117 
His Lys Gin Arg Gin Gly * 

35 ■ • 



<210> 143 
<211> 1332 
<212> DNA 

<213> Escherichia coli 
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<221> CDS 

<222> (1) . . . (1332) 

<400> 143 

atg get aaa caa ccg gga tta gat ttt caa agt gec aaa ggt ggc tta 

Met Ala Lys Gin Pro Gly Leu Asp Phe Gin Ser Ala Lys Gly Gly Leu 

15 10 15 

ggc gag ctg aaa cgc aga ctg ctg ttt gtt ate ggt gcg ctg att gtg 
Gly Glu Leu Lys Arg Arg Leu Leu Phe Val He Gly Ala Leu He Val 
20 25 30 



ctt gee aaa ctg ctt gag caa cag cga ggc acc ate att gag atg ttt 

Leu Ala Lys Leu Leu Glu Gin Gin Arg Gly Thr He He Glu Met Phe 

50 55 60 

aac atg ttc tct ggt ggt get etc age cgt get tct ate ttt get ctg 

Asn Met Phe Ser Gly Gly Ala Leu Ser Arg Ala Ser He Phe Ala Leu 

65 70 75 80 

ggg ate atg ccg tat att teg gcg teg ate att ate cag ctg ctg acg 

Gly He Met Pro Tyr lie Ser Ala Ser He He He Gin Leu Leu Thr 

85 90 95 



gee ggt att gtc gcg gga etc ccg cca gee att gee cat act ate gag 

Ala Gly lie Val Ala Gly Leu Pro Pro Ala He Ala His Thr He Glu 
195 200 205 

caa gcg cgt caa ggc gac ctg cac ttc etc gtg ttg ctg ttg gtt gca 

Gin Ala Arg Gin Gly Asp Leu His Phe Leu Val Leu Leu Leu Val Ala 

210 ~ 215 220 



48 



96 



ttc cgt att ggc tct ttt att ccg ate cct ggt att gat gee get gta 14 4 

Phe Arg He Gly Ser Phe He Pro He Pro Gly He Asp Ala Ala Val 
35 40 45 



192 



240 



288 



gtg gtt cac cca acg ttg gca gaa att aag aaa gaa ggg gag tct ggt 336 
Val Val His Pro Thr Leu Ala Glu He Lys Lys Glu Gly Glu Ser Gly 
100 105 HO 

cgt cgt aag ate age cag tac acc cgc tac ggt act ctg gtg ctg gca 
Arg Arg Lys He Ser Gin Tyr Thr Arg Tyr Gly Thr Leu Val Leu Ala 
115 120 125 



384 



ata ttc cag teg ate ggt att get acc ggt ctg ccg aat atg cct ggt 432 
He Phe Gin Ser He Gly He Ala Thr Gly Leu Pro Asn Met Pro Gly 
130 135 140 

atg caa ggc ctg gtg att aac ccg ggc ttt gca ttc tac ttc acc get 
Met Gin Gly Leu Val He Asn Pro Gly Phe Ala Phe Tyr Phe Thr Ala 
145 150 155 160 

gtt gta agt ctg gtc aca gga acc atg ttc ctg atg tgg ttg ggc gaa 
Val Val Ser Leu Val Thr Gly Thr Met Phe Leu Met Trp Leu Gly Glu 
165 170 175 



480 



528 



cag att act gaa cga ggt ate ggc aac ggt att tea ate att ate ttc 576 
Gin He Thr Glu Arg Gly He Gly Asn Gly He Ser He He lie Phe 
180 185 190 



624 



672 
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gta tta gta ttt gca gtg acg ttc ttt gtt gta ttt gtt gag cgt ggt 
Val Leu Val Phe Ala Val Thr Phe Phe Val Val Phe Val Glu Arg Gly 
225 230 235 240 

caa cgc cgc att gtg gta aac tac gcg aaa cgt cag caa ggt cgt cgt 
Gin Arq Arg He Val Val Asn Tyr Ala Lys Arg Gin Gin Gly Arg Arg 
245 250 255 

gtc tat get gca cag age aca cat tta ccg ctg aaa gtg aat atg gcg 
Val Tyr Ala Ala Gin Ser Thr His Leu Pro Leu Lys Val Asn Met Ala 
260 265 270 

ggg gta ate ccg gca ate ttc get tec agt att att ctg ttc ccg gcg 
Gly Val He Pro Ala He Phe Ala Ser Ser He lie Leu Phe Pro Ala 
275 280 285 

ace ate gcg tea tgg ttc ggg ggc ggt act ggt tgg aac tgg ctg aca 
Thr lie Ala Ser Trp Phe Gly Gly Gly Thr Gly Trp Asn Trp Leu Thr 
290 295 300 

aca att teg ctg tat ttg cag cct ggg caa ccg ctt tat gtg tta etc 
Thr He Ser Leu Tyr Leu Gin Pro Gly Gin Pro Leu Tyr Val Leu Leu 
305 310 315 320 

tat gcg tct gca ate ate ttc ttc tgt ttc ttc tac acg gcg ttg gtt 
Tyr Ala Ser Ala He He Phe Phe Cys Phe Phe Tyr Thr Ala Leu Val 
325 330 335 

ttc aac ccg cgt gaa aca gca gat aac ctg aag aag tec ggt gca ttt 
Phe Asn Pro Arg Glu Thr Ala Asp Asn Leu Lys Lys Ser Gly Ala Phe 
340 345 350 

gta cca gga att cgt ccg gga gag caa acg gcg aag tat ate gat aaa 
Val Pro Gly He Arg Pro Gly Glu Gin Thr Ala Lys Tyr lie Asp Lys 
355 360 365 



720 



768 



816 



864 



912 



960 



1008 



1056 



1104 



gta atg ace cgc ctg ace ctg gtt ggt gcg ctg tat att ace ttt ate 1152 
Val Met Thr Arg Leu Thr Leu Val Gly Ala Leu Tyr He Thr Phe He 
370 375 380 

tgc ctg ate ccg gag ttc atg cgt gat gca atg aaa gta ccg ttc tac 
Cys Leu He Pro Glu Phe Met Arg Asp Ala Met Lys Val Pro Phe Tyr 
385 390 395 400 

ttc ggt ggg ace tea ctg ctt ate gtt gtt gtc gtg att atg gac ttt 
Phe Gly Gly Thr Ser Leu Leu lie Val Val Val Val He Met Asp Phe 
405 410 415 

atg get caa gtg caa act ctg atg atg tec agt cag tat gag tct gca 
Met Ala Gin Val Gin Thr Leu Met Met Ser Ser Gin Tyr Glu Ser Ala 
420 425 430 

ttg aag aag gcg aac ctg aaa ggc tac ggc cga taa . 1332 

Leu Lys Lys Ala Asn Leu Lys Gly Tyr Gly Arg * 
435 440 



<210> 144 
<211> 435 
<212> DNA 



1200 



1248 



1296 
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<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . . (435) 

<400> 144 

atg cgt tta aat act ctg tct ccg gcc gaa ggc tec aaa aag gcg ggt 

Met Arg Leu Asn Thr Leu Ser Pro Ala Glu Gly Ser Lys Lys Ala Gly 
1 5 10 15 

aaa cgc ctg ggt cgt ggt ate ggt tct ggc etc ggt aaa acc ggt ggt 
Lys Arg Leu Gly Arg Gly He Gly Ser Gly Leu Gly Lys Thr Gly Gly 
20 25 30 



ggt ttc gag ggt ggt cag atg cct ctg tac cgt cgt ctg ccg aaa ttc 
Gly Phe Glu Gly Gly Gin Met Pro Leu Tyr Arg Arg Leu Pro Lys Phe 
50 55 60 



ggc 



gac ctg get aaa gta gaa ggc ggt gta gta gac ctg aac acg ctg aaa 
Asp Leu Ala Lys Val Glu Gly Gly Val Val Asp Leu Asn Thr Leu Lys 
85 90 95 



get ggc gaa gta acg act ccg gta act gtt cgt ggc ctg cgt gtt act 

Ala Gly Glu Val Thr Thr Pro Val Thr Val Arg Gly Leu Arg Val Thr 

115 120 125 

aaa ggc get cgt get get ate gaa get get ggc ggt aaa ate gag gaa 

Lys Gly Ala Arg Ala Ala He Glu Ala Ala Gly Gly Lys He Glu Glu 

130 135 140 

taa 



48 



96 



cgt ggt cac aaa ggt cag aag tct cgt tct ggc ggt ggc gta cgt cgc 144 
Arg Gly His Lys Gly Gin Lys Ser Arg Ser Gly Gly Gly Val Arg Arg 
35 4 0 45 



192 



ttc act tct cgt aaa gca gcg att aca gcc gaa att cgt ctg tct 2 40 

Gly Phe Thr Ser Arg Lys Ala Ala He Thr Ala Glu He Arg Leu Ser 
65 70 75 80 



288 



gcg get aac att ate ggt ate cag ate gag ttc gcg aaa gtg ate ctg 336 
Ala Ala Asn He He Gly He Gin He Glu Phe Ala Lys Val He Leu 
100 105 HO 



384 



4 32 



435 



<210> 145 
<211> 180 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . . (180) 

<400> 145 

atg gca aag act att aaa att act caa acc cgc agt gca ate ggt cgt 
Met Ala Lys Thr lie Lys He Thr Gin Thr Arg Ser Ala lie Gly Arg 
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10 15 



aac gcg gtt tec ttc atg gtt aaa gtt gag gag taa 
Asn Ala Val Ser Phe Met Val Lys Val Glu Glu * 
50 55 



<210> 146 
<211> 504 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) - (504) 

<400> 146 

atg get cac ate gaa aaa caa get ggc gaa ctg cag gaa aag ctg ate 
Met Ala His He Glu Lys Gin Ala Gly Glu Leu Gin Glu Lys Leu He 
1 5 10 15 

gcg gta aac cgc gta tct aaa acc gtt aaa ggt ggt cgt att ttc tec 
Ala Val Asn Arg Val Ser Lys Thr Val Lys Gly Gly Arg He Phe Ser 
20 25 30 



gaa aaa gee cgt cgc aat atg att aac gtc gcg ctg aat aac ggc act 
Glu Lys Ala Arg Arg Asn Met lie Asn Val Ala Leu Asn Asn Gly Thr 
65 10 75 80 

ctg caa cac cct gtt aaa ggt gtt cac acg ggt tct cgc gta ttc atg 
Leu Gin His Pro Val Lys Gly Val His Thr Gly Ser Arg Val Phe Met 
85 90 95 

cag ccg get tec gaa ggt acc ggt ate ate gee ggt ggt gca atg cgc 
Gin Pro Ala Ser Glu Gly Thr Gly He He Ala Gly Gly Ala Met Arg 
100 105 HO 

gec gtt ctg gaa gtc get ggg gtt cat aac gtt ctg get aaa gec tat 
Ala Val Leu Glu Val Ala Gly Val His Asn Val Leu Ala Lys Ala Tyr 
115 120 125 

ggt tec acc aac ccg ate aac gtg gtt cgt gca act att gat ggc ctg 
Gly Ser Thr Asn Pro He Asn Val Val Arg Ala Thr He Asp Gly Leu 
130 135 140 



96 



ctg ccg aaa cac aag gca acg ctg ctt ggc ctg ggt ctg cgt cgt att 
Leu Pro Lys His Lys Ala Thr Leu Leu Gly Leu Gly Leu Arg Arg He 
20 25 30 

ggt cac acc gta gag cgc gag gat act cct get att cgc ggt atg ate 14 4 

Gly His Thr Val Glu Arg Glu Asp Thr Pro Ala He Arg Gly Met He 
35 40 45 



180 



48 



96 



ttc aca get ctg act gta gtt ggc gat ggt aac ggt cgc gtt ggt ttt 144 

Phe Thr Ala Leu Thr Val Val Gly Asp Gly Asn Gly Arg Val Gly Phe 

35 40 45 

ggt tac ggt aaa gcg cgt gaa gtt cca gca gcg ate cag aaa gcg atg 

Gly Tyr Gly Lys Ala Arg Glu Val Pro Ala Ala He Gin Lys Ala Met 
50 " 55 60 



192 



240 



288 



336 



384 



432 
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gaa aat atg aat tct cca gaa atg gtc get gec aag cgt ggt aaa tec 480 
Glu Asn Met Asn Ser Pro Glu Met Val Ala Ala Lys Arg Gly Lys Ser 
145 150 155 160 



gtt gaa gaa att ctg ggg aaa taa 
Val Glu Glu lie Leu Gly Lys * 
165 



504 



<210> 147 
<211> 354 
<212> DNA 

<213> Escherichia coli 




















<220> 
<221> CDS 
<222> (1)... 


(354) 






















<400> 147 
atg gat aag 
Met Asp Lys 
1 


aaa 
Lys 


tct 
Ser 
5 


get 
Ala 


cgt 
Arg 


ate cgt cgt 
He Arg Arg 
10 


gcg acc cgc 
Ala Thr Arg 


gca cgc 
Ala Arg 
15 


cgc 
Arg 


aag etc cag 
Lys Leu Gin 


gag 
Glu 
20 


ctg 
Leu 


99 c 
Gly 


gca 


act cgc 
Thr Arg 
25 


ctg 
Leu 


gtg gta 
Val Val 


cat 
His 


cgt 
Arg 
30 


acc 
Thr 


ccg 
Pro 


cgt cac att 
Arg His lie 
35 


tac 
Tyr 


gca 
Ala 


cag 
Gin 


gta 
Val 


att 
He 
40 


gca 
Ala 


ccg 
Pro 


aac ggt 
Asn Gly 


tct 
Ser 
45 


gaa 
Glu 


gtt 
Val 


ctg 
Leu 


gta get get 
Val Ala Ala 
50 


tct 
Ser 


act 
Thr 


gta 
Val 


gaa 
Glu 
55 


aaa 
Lys 


get 
Ala 


ate 
He 


get gaa 
Ala Glu 
60 


caa 
Gin 


ctg 
Leu 


aag 
Lys 


tac 
Tyr 


acc ggt aac 
Thr Gly Asn 
65 


aaa 
Lys 


gac 
Asp 


gcg 
Ala 
70 


get 
Ala 


gca 
Ala 


get 
Ala 


gtg 
Val 


ggt aaa get 
Gly Lys Ala 
75 


gtc 
Val 


get 
Ala 


gaa 
Glu 
80 


cgc get ctg 
Arg Ala Leu 


gaa 
Glu 


aaa 
Lys 
85 


ggc 
Gly 


ate 
He 


aaa 
Lys 


gat 
Asp 


gta 
Val 
90 


tec ttt 
Ser Phe 


gac 
Asp 


cgt 
Arg 


tec 
Ser 
95 


ggg 

Gly 


ttc caa tat 
Phe Gin Tyr 


cat 
His 
100 


ggt 
Gly 


cgt 
Arg 


gtc 
Val 


cag 
Gin 


gca 
Ala 
105 


ctg 
Leu 


gca gat get 
Ala Asp Ala 


gee cgt gaa 
Ala Arg Glu 
110 



get ggc ctt cag ttc taa 
Ala Gly Leu Gin Phe * 
115 



48 



96 



144 



192 



240 



288 



336 



354 



<210> 148 
<211> 534 
<212> DNA 

<213> Escherichia coli 

<220> 
<221> CDS 
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<222> (1)...(534) 
<400> 148 

atg tct cgt gtt get aaa gca ccg gtc gtt gtt cct gec ggc gtt gac 
Met Ser Arg Val Ala Lys Ala Pro Val Val Val Pro Ala Gly Val Asp 
15 10 15 

gta aaa ate aac ggt cag gtt att acg ate aaa ggt aaa aac ggc gag 
Val Lys He Asn Gly Gin Val He Thr He Lys Gly Lys Asn Gly Glu 
20 25 30 

ctg act cgt act etc aac gat get gtt gaa gtt aaa cat gca gat aat 
Leu Thr Arg Thr Leu Asn Asp Ala Val Glu Val Lys His Ala Asp Asn 
35 40 45 

ace ctg ace ttc ggt ccg cgt gat ggt tac gca gac ggt tgg gca cag 
Thr Leu Thr Phe Gly Pro Arg Asp Gly Tyr Ala Asp Gly Trp Ala Gin 
50 55 60 

get ggt acc gcg cgt gec ctg ctg aac tea atg gtt ate ggt gtt ace 
Ala Gly Thr Ala Arg Ala Leu Leu Asn Ser Met Val He Gly Val Thr 
65 70 75 80 

gaa ggc ttc act aag aag ctg cag ctg gtt ggt gta ggt tac cgt gca 
Glu Gly Phe Thr Lys Lys Leu Gin Leu Val Gly Val Gly Tyr Arg Ala 
85 90 95 

gcg gtt aaa ggc aat gtg att aac ctg tct ctg ggt ttc tct cat cct 
Ala Val Lys Gly Asn Val He Asn Leu Ser Leu Gly Phe Ser His Pro 
100 105 HO 

gtt gac cat cag ctg cct gcg ggt ate act get gaa tgt ccg act cag 
Val Asp His Gin Leu Pro Ala Gly He Thr Ala Glu Cys Pro Thr Gin 
115 120 125 

act gaa ate gtg ctg aaa ggc get gat aag cag gtg ate ggc cag gtt 
Thr Glu He Val Leu Lys Gly Ala Asp Lys Gin Val He Gly Gin Val 
130 135 140 

gca gcg gat ctg cgc gec tac cgt cgt cct gag cct tat aaa ggc aag 
Ala Ala Asp Leu Arg Ala Tyr Arg Arg Pro Glu Pro Tyr Lys Gly Lys 
145 150 155 160 

ggt gtt cgt tac gee gac gaa gtc gtg cgt acc aaa gag get aag aag 
Gly Val Arg Tyr Ala Asp Glu Val Val Arg Thr Lys Glu Ala Lys Lys 
165 170 175 



48 



96 



144 



192 



240 



288 



336 



384 



432 



480 



528 



aag taa 534 
Lys 



+ 



<210> 149 
<211> 393 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . . (393) 
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<400> 149 

atg age atg caa gat ccg ate gcg gat atg ctg acc cgt ate cgt aac 4 8 

Met Ser Met Gin Asp Pro He Ala Asp Met Leu Thr Arg He Arg Asn 
15 10 15 

ggt cag gec gcg aac aaa get gcg gtc acc atg cct tec tec aag ctg 96 
Gly Gin Ala Ala Asn Lys Ala Ala Val Thr Met Pro Ser Ser Lys Leu 
20 25 30 

aaa gtg gca ate gee aac gtg ctg aag gaa gaa ggt ttt att gaa gat 144 
Lys Val Ala He Ala Asn Val Leu Lys Glu Glu Gly Phe He Glu Asp 
35 40 45 

ttt aaa gtt gaa ggc gac acc aag cct gaa ctg gaa ctt act ctg aag 192 
Phe Lys Val Glu Gly Asp Thr Lys Pro Glu Leu Glu Leu Thr Leu Lys 
50 55 60 

tat ttc cag ggc aaa get gtt gta gaa age att cag cgt gtc age cgc 
Tyr Phe Gin Gly Lys Ala Val Val Glu Ser He Gin Arg Val Ser Arg 
65 70 75 80 

cca ggt ctg cgc ate tat aaa cgt aaa gat gag ctg ccg aaa gtt atg 
Pro Gly Leu Arg He Tyr Lys Arg Lys Asp Glu Leu Pro Lys Val Met 
85 90 95 

gcg ggt ctg ggt ate gca gtt gtt tct acc tct aaa ggt gtt atg act 
Ala Gly Leu Gly He Ala Val Val Ser Thr Ser Lys Gly Val Met Thr 
100 105 HO 

gat cgt gca gcg cgc cag get ggt ctt ggt ggc gaa att ate tgc tac 
Asp Arg Ala Ala Arg Gin Ala Gly Leu Gly Gly Glu He He Cys Tyr 
115 120 125 

393 

gta gee taa 
Val Ala * 
130 



240 



288 



336 



384 



<210> 150 
<211> 306 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . . (306) 

<400> 150 

atg get aag caa tea atg aaa gca cgc gaa gta aaa cgc gta get tta 
Met Ala Lys Gin Ser Met Lys Ala Arg Glu Val Lys Arg Val Ala Leu 
15 10 15 

get gat aaa tac ttc gcg aaa cgc get gaa ctg aaa gcg ate ate tct 
Ala Asp Lys Tyr Phe Ala Lys Arg Ala Glu Leu Lys Ala He He Ser 
20 25 30 

gat gtg aac get tec gac gaa gat cgt tgg aac get gtt etc aag ctg 
Asp Val Asn Ala Ser Asp Glu Asp Arg Trp Asn Ala Val Leu Lys Leu 
35 40 45 



48 



96 



144 
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cag act ctg ccg cgt gat tec age ccg tct cgt cag cgt aac cgc tgc 192 
Gin Thr Leu Pro Arg Asp Ser Ser Pro Ser Arg Gin Arg Asn Arg Cys 
50 ' 55 60 

cgt caa aca ggt cgt ccg cat ggt ttc ctg egg aag ttc ggg ttg age 240 
Arg Gin Thr Gly Arg Pro His Gly Phe Leu Arg Lys Phe Gly Leu Ser 
65 70 75 80 



cgt att aag gtc cgt gaa gee get atg cgc ggt gaa ate ccg ggt ctg 
Arg lie Lys Val Arg Glu Ala Ala Met Arg Gly Glu lie Pro Gly Leu 
85 90 95 



ctg ctg gat aac gca gca gca gac ctg gca gca ate tec ggt caa aaa 
Leu Leu Asp Asn Ala Ala Ala Asp Leu Ala Ala lie Ser Gly Gin Lys 
50 55 60 



288 



aaa aag get age tgg taa 306 
Lys Lys Ala Ser Trp * 
100 



<210> 151 
<211> 540 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> {!)... (540) 

<400> 151 

atg gcg aaa ctg cat gat tac tac aaa gac gaa gta gtt aaa aaa etc 48 

Met Ala Lys Leu His Asp Tyr Tyr Lys Asp Glu Val Val Lys Lys Leu 
1 5 10 15 

atg act gag ttt aac tac aat tct gtc atg caa gtc cct egg gtc gag 96 
Met Thr Glu Phe Asn Tyr Asn Ser Val Met Gin Val Pro Arg Val Glu 
20 25 30 

aag ate acc ctg aac atg ggt gtt ggt gaa gcg ate get gac aaa aaa 144 
Lys lie Thr Leu Asn Met Gly Val Gly Glu Ala He Ala Asp Lys Lys 
35 40 45 



192 



ccg ctg ate acc aaa gca cgc aaa tct gtt gca ggc ttc aaa ate cgt 240 
Pro Leu He Thr Lys Ala Arg Lys Ser Val Ala Gly Phe Lys He Arg 
65 70 75 80 

cag ggc tat ccg ate ggc tgt aaa gta act ctg cgt ggc gaa cgc atg 288 
Gin Gly Tyr Pro He Gly Cys Lys Val Thr Leu Arg Gly Glu Arg Met 
85 90 95 

tgg gag ttc ttt gag cgc ctg ate act att get gta cct cgt ate cgt 336 
Trp Glu Phe Phe Glu Arg Leu lie Thr He Ala Val Pro Arg He Arg. 
100 .105 110 

gac ttc cgt ggc ctg tec get aag tct ttc gac ggt cgt ggt aac tac 384 
Asp Phe Arg Gly Leu Ser Ala Lys Ser Phe Asp Gly Arg Gly Asn Tyr 
115 120 125 
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age atg ggt gtc cgt gag cag ate ate 
Ser Met Gly Val Arg Glu Gin lie lie 
130 135 

aaa gtc gac cgc gtt cgt ggt ttg gat 
Lys Val Asp Arg Val Arg Gly Leu Asp 
145 150 

aaa tct gac gaa gaa ggc cgc get ctg 
Lys Ser Asp Glu Glu Gly Arg Ala Leu 
165 

ttc cgc aag taa 
Phe Arg Lys * 



ttc cca gaa ate gac tac gat 4 32 

Phe Pro Glu lie Asp Tyr Asp 
140 

att acc att acc act act gcg 480 

He Thr He Thr Thr Thr Ala 
155 160 

ctg get gee ttt gac ttc ccg 528 
Leu Ala Ala Phe Asp Phe Pro 
170 175 

540 



<210> 152 
<211> 315 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . . (315) 

<400> 152 

atg gca gcg aaa ate cgt cgt gat gac gaa gtt ate gtg tta acc ggt 48 

Met Ala Ala Lys He Arg Arg Asp Asp Glu Val He Val Leu Thr Gly 

15 10 15 

aaa gat aaa ggt aaa cgc ggt aaa gtt aag aat gtc ctg tct tec ggc 96 
Lys Asp Lys Gly Lys Arg Gly Lys Val Lys Asn Val Leu Ser Ser Gly 
20 " 25 30 

aag gtc att gtt gaa ggt ate aac ctg gtt aag aaa cat cag aag ccg 144 
Lys Val He Val Glu Gly He Asn Leu Val Lys Lys His Gin Lys Pro 
35 40 45 

gtt ccg gee ctg aac caa ccg ggt ggc ate gtt gaa aaa gaa gee get 192 
Val Pro Ala Leu Asn Gin Pro Gly Gly He Val Glu Lys Glu Ala Ala 
50 55 60 

att cag gtt tec aac gta gca ate ttc aat gcg gca acc ggc aag get 240 
He Gin Val Ser Asn Val Ala He Phe Asn Ala Ala Thr Gly Lys Ala 
65 70 75 80 

gac cgt gta ggc ttt aga ttc gaa gac ggt aaa aaa gtc cgt ttc ttc 288 
Asp Arg Val Gly Phe Arg Phe Glu Asp Gly Lys Lys Val Arg Phe Phe 
85 , 90 95 

aag tct aac age gaa act ate aag taa 315 
Lys Ser Asn Ser Glu Thr He Lys * 
100 



<210> 153 
<211> 372 
<212> DNA 

<213> Escherichia coli 
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<220> 

<221> CDS 

<222> (1) . . . (372) 

<400> 153 

atg ate caa gaa cag act atg ctg aac gtc gec gac aac tec ggt gca 48 
Met lie Gin Glu Gin Thr Met Leu Asn Val Ala Asp Asn Ser Gly Ala 
1 5 10 15 

cgt cgc gta atg tgt ate aag gtt ctg ggt ggc teg cac cgt cgc tac 96 
Arg Arg Val Met Cys He Lys Val Leu Gly Gly Ser His Arg Arg Tyr 
20 25 30 

gca ggc gta ggc gac ate ate aag ate acc ate aaa gaa gca att ccg 144 
Ala Gly Val Gly Asp He He Lys He Thr He Lys Glu Ala He Pro 
35 40 45 

cgt ggt aag gtc aaa aaa ggt gat gtg ctg aag gcg gta gtg gtg cgc 192 
Arg Gly Lys Val Lys Lys Gly Asp Val Leu Lys Ala Val Val Val Arg 
50 55 60 

acc aag aag ggt gtt cgt cgc ccg gac ggt tct gtc att cgc ttc gat 240 
Thr Lys Lys Gly Val Arg Arg Pro Asp Gly Ser Val He Arg Phe Asp 
65 70 7 5 80 

ggt aat get tgt gtt ctt ctg aac aac aac age gag cag cct ate ggt 288 
Gly Asn Ala Cys Val Leu Leu Asn Asn Asn Ser Glu Gin Pro He Gly 
85 90 95 

acg cgt att ttt ggg ccg gta act cgt gag ctt cgt agt gag aag ttc 336 
Thr Arg He Phe Gly Pro Val Thr Arg Glu Leu Arg Ser Glu Lys Phe 
100 105 110 

atg aaa att ate tct ctg gca cca gaa gta etc taa 372 
Met Lys He He Ser Leu Ala Pro Glu Val Leu * 
115 120 



<210> 154 
<211> 1191 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . . (1191) 

<400> 154 

atg aaa ata aaa aca ggt gca cgc ate etc gca tta tec gca tta acg 48 
Met Lys He Lys Thr Gly Ala Arg He Leu Ala Leu Ser Ala Leu Thr 
15 10 15 

acg atg atg ttt tec gee teg get etc gee aaa ate gaa gaa ggt aaa - 96 

Thr Met Met Phe Ser Ala Ser Ala Leu Ala Lys He Glu Glu Gly Lys 
20 25 30 

ctg gta ate tgg att aac ggc gat aaa ggc tat aac ggt etc get gaa 144 
Leu Val He Trp He Asn Gly Asp Lys Gly Tyr Asn Gly Leu Ala Glu 
35 40 45 
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gtc ggt aag aaa ttc gag aaa gat acc gga att aaa gtc acc gtt gag 192 
Val Gly Lys Lys Phe Glu Lys Asp Thr Gly He Lys Val Thr Val Glu 
50 55 60 

cat ccg gat aaa ctg gaa gag aaa ttc cca cag gtt gcg gca act ggc 240 
His Pro Asp Lys Leu Glu Glu Lys Phe Pro Gin Val Ala Ala Thr Gly 
65 * 70 75 80 

gat ggc cct gac att ate ttc tgg gca cac gac cgc ttt ggt ggc tac 288 
Asp Gly Pro Asp lie He Phe Trp Ala His Asp Arg Phe Gly Gly Tyr 
85 90 95 

get caa tct ggc ctg ttg get gaa ate acc ccg gac aaa gcg ttc cag 336 
Ala Gin Ser Gly Leu Leu Ala Glu He Thr Pro Asp Lys Ala Phe Gin 
100 105 110 

gac aag ctg tat ccg ttt acc tgg gat gec gta cgt tac aac ggc aag 384 
Asp Lys Leu Tyr Pro Phe Thr Trp Asp Ala Val Arg Tyr Asn Gly Lys 
115 ^ 120 125 

ctg att get tac ccg ate get gtt gaa gcg tta teg ctg att tat aac 4 32 

Leu He Ala Tyr Pro He Ala Val Glu Ala Leu Ser Leu He Tyr Asn 
130 " 135 140 

aaa gat ctg ctg ccg aac ccg cca aaa acc tgg gaa gag ate ccg gcg 4 80 

Lys Asp Leu Leu Pro Asn Pro Pro Lys Thr Trp Glu Glu He Pro Ala 
145 150 155 160 

ctg gat aaa gaa ctg aaa gcg aaa ggt aag age gcg ctg atg ttc aac 528 
Leu Asp Lys Glu Leu Lys Ala Lys Gly Lys Ser Ala Leu Met Phe Asn 
165 170 175 

ctg caa gaa ccg tac ttc acc tgg ccg ctg att get get gac ggg ggt 57 6 

Leu Gin Glu Pro Tyr Phe Thr Trp Pro Leu He Ala Ala Asp Gly Gly 
180 185 190 

tat gcg ttc aag tat gaa aac ggc aag tac gac att aaa gac gtg ggc 624 
Tyr Ala Phe Lys Tyr Glu Asn Gly Lys Tyr Asp lie Lys Asp Val Gly 
195 200 205 

gtg gat aac get ggc gcg aaa gcg ggt ctg acc ttc ctg gtt gac ctg 672 
Val Asp Asn Ala Gly Ala Lys Ala Gly Leu Thr Phe Leu Val Asp Leu 
210 215 220 

att aaa aac aaa cac atg aat gca gac acc gat tac tec ate gca gaa 720 
He Lys Asn Lys His Met Asn Ala Asp Thr Asp Tyr Ser He Ala Glu 
225 J 230 235 240 

get gee ttt aat aaa ggc gaa aca gcg atg acc ate aac ggc ccg tgg 768 
Ala Ala Phe Asn Lys Gly Glu Thr Ala Met Thr He Asn Gly Pro Trp 
245 250 255 

gca tgg tec aac ate gac acc age aaa gtg aat tat ggt gta acg gta 816 
Ala Trp Ser Asn He Asp Thr Ser Lys Val Asn Tyr Gly Val Thr Val 
260 265 270 

ctg ccg acc ttc aag ggt caa cca tec aaa ccg ttc gtt ggc gtg ctg 8 64 

Leu Pro Thr Phe Lys Gly Gin Pro Ser Lys Pro Phe Val Gly Val Leu 
275 280 285 
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age gca ggt att aac gec gec agt ccg aac aaa gag ctg gcg aaa gag 912 
Ser Ala Gly He Asn Ala Ala Ser Pro Asn Lys Glu Leu Ala Lys Glu 
290 295 300 

ttc etc gaa aac tat ctg ctg act gat gaa ggt ctg gaa gcg gtt aat 960 
Phe Leu Glu Asn Tyr Leu Leu Thr Asp Glu Gly Leu Glu Ala Val Asn 
305 310 315 320 

aaa gac aaa ccg ctg ggt gec gta gcg ctg aag tct tac gag gaa gag 1008 
Lys Asp Lys Pro Leu Gly Ala Val Ala Leu Lys Ser Tyr Glu Glu Glu 
325 330 335 

ttg gcg aaa gat cca cgt att gec gec ace atg gaa aac gec cag aaa 1056 
Leu Ala Lys Asp Pro Arg lie Ala Ala Thr Met Glu Asn Ala Gin Lys 
340 345 350 

ggt gaa ate atg ccg aac ate ccg cag atg tec get ttc tgg tat gee 1104 
Gly Glu He Met Pro Asn He Pro Gin Met Ser Ala Phe Trp Tyr Ala 
355 360 365 

gtg cgt act gcg gtg ate aac gee gee age ggt cgt cag act gtc gat 1152 
Val Arg Thr Ala Val He Asn Ala Ala Ser Gly Arg Gin Thr Val Asp 
370 375 380 

gaa gec ctg aaa gac gcg cag act cgt ate acc aag taa 1191 
Glu Ala Leu Lys Asp Ala Gin Thr Arg He Thr Lys * 
385 390 395 



<210> 155 
<211> 1545 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . . (1545) 

<400> 155 

atg gat gtc att aaa aag aaa cat tgg tgg caa age gac gcg ctg aaa 

Met Asp Val He Lys Lys Lys His Trp Trp Gin Ser Asp Ala Leu Lys 

1 5 10 15 



48 



tgg tea gtg eta ggt ctg etc ggc ctg ctg gtg ggt tac ctt gtt gtt 96 
Trp Ser Val Leu Gly Leu Leu Gly Leu Leu Val Gly Tyr Leu Val Val 
20 25 30 

tta atg tac gca caa ggg gaa tac ctg ttc gec att acc acg ctg ata 144 
Leu Met Tyr Ala Gin Gly Glu Tyr Leu Phe Ala He Thr Thr Leu He 
35 40 45 

ttg agt tea gcg ggg ctg tat att ttc gee aat cgt aaa gee tac gee 192 
Leu Ser Ser Ala Gly Leu Tyr He Phe Ala Asn Arg Lys Ala Tyr Ala 
50 55 60 

tgg cgc tat gtt tac ccg gga atg get gga atg gga tta ttc gtc etc 240 
Trp Arg Tyr Val Tyr Pro Gly Met Ala Gly Met Gly Leu Phe Val Leu 
65 70 75 80 
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ttc cct ctg gtc tgc acc ate gec att gec ttc acc aac tac age age 288 
Phe Pro Leu Val Cys Thr He Ala He Ala Phe Thr Asn Tyr Ser Ser 
85 90 95 

act aac cag ctg act ttt gaa cgt gcg cag gaa gtg ttg tta gat cgc 336 
Thr Asn Gin Leu Thr Phe Glu Arg Ala Gin Glu Val Leu Leu Asp Arg 
100 105 110 

tec tgg caa gca ggc aaa acc tat aac ttt ggt ctt tac ccg gcg ggc 384 
Ser Trp Gin Ala Gly Lys Thr Tyr Asn Phe Gly Leu Tyr Pro Ala Gly 
115 120 125 

gat gag tgg caa ctg gcg etc age gac ggc gaa acc ggc aaa aat tac 432 
Asp Glu Trp Gin Leu Ala Leu Ser Asp Gly Glu Thr Gly Lys Asn Tyr 
130 135 140 

etc tec gac get ttt aaa ttt ggc ggc gag caa aaa ctg caa ctg aaa 4 80 

Leu Ser Asp Ala Phe Lys Phe Gly Gly Glu Gin Lys Leu Gin Leu Lys 
145 150 155 160 

gaa acg acc gec cag ccc gaa ggc gaa cgc gcg aat ctg cgc gtg att 528 
Glu Thr Thr Ala Gin Pro Glu Gly Glu Arg Ala Asn Leu Arg Val He 
165 170 175 

acc cag aat cgt cag gcg ctg agt gac att acc gee att ctg ccg gat 57 6 

Thr Gin Asn Arg Gin Ala Leu Ser Asp He Thr Ala He Leu Pro Asp 
180 185 190 

ggc aac aaa gtg atg atg age tec ctg cgc cag ttt tct ggc acg cag 624 
Gly Asn Lys Val Met Met Ser Ser Leu Arg Gin Phe Ser Gly Thr Gin 
195 200 205 

ccg etc tac aca etc gac ggt gac ggc acg ttg acg aat aat cag age 672 
Pro Leu Tyr Thr Leu Asp Gly Asp Gly Thr Leu Thr Asn Asn Gin Ser 
210 215 220 

ggc gtg aaa tat cgt ccg aat aac caa att ggc ttt tac cag tec att 720 
Gly Val Lys Tyr Arg Pro Asn Asn Gin He Gly Phe Tyr Gin Ser He 
225 230 235 240 

acc gee gac ggc aac tgg ggt gat gaa aag eta age ccc ggt tac acc 768 
Thr Ala Asp Gly Asn Trp Gly Asp Glu Lys Leu Ser Pro Gly Tyr Thr 
245 250 255 

gtg acc acc ggc tgg aaa aac ttt acc cgc gtc ttt acc gac gaa ggc 816 
Val Thr Thr Gly Trp Lys Asn Phe Thr Arg Val Phe Thr Asp Glu Gly 
260 265 270 

att cag aaa ccg ttc etc gee att ttc gtc tgg acc gtg gtg ttc teg 8 64 

He Gin Lys Pro Phe Leu Ala He Phe Val Trp Thr Val Val Phe Ser 
275 280 285 

ctg ate act gtc ttt tta acg gtg gcg gtc ggc atg gtt ctg gcg tgt 912 
Leu He Thr Val Phe Leu Thr Val Ala Val Gly Met Val Leu Ala Cys 
290 295 300 

ctg gtg cag tgg gaa gcg ttg cgc ggc aaa gcg gtc tat cgc gtc ctg 960 
Leu Val Gin Trp Glu Ala Leu Arg Gly Lys Ala Val Tyr Arg Val Leu 
305 310 315 320 
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ctg att ctg ccc tac gcg gtg cca teg ttc att tea ate ttg att ttc 1008 

Leu lie Leu Pro Tyr Ala Val Pro Ser Phe lie Ser lie Leu lie Phe 

325 330 335 

aaa ggg ttg ttt aac cag age ttc ggt gaa ate aac atg atg ttg age 1056 

Lys Gly Leu Phe Asn Gin Ser Phe Gly Glu lie Asn Met Met Leu Ser 

340 345 350 



gcg ctg ttt ggc gtg aag ccc gee tgg ttc age gat ccg acc acc gee 
Ala Leu Phe Gly Val Lys Pro Ala Trp Phe Ser Asp Pro Thr Thr Ala 
355 360 365 



1104 



cgc acg atg eta att ate gtc aat acc tgg ctg ggt tat ccg tac atg 1152 
Arg Thr Met Leu lie lie Val Asn Thr Trp Leu Gly Tyr Pro Tyr Met 
370 375 380 

atg ate etc tgc atg ggc ttg ctg aaa gcg att ccg gac gat ttg tat 1200 
Met He Leu Cys Met Gly Leu Leu Lys Ala He Pro Asp Asp Leu Tyr 
385 390 395 400 

gaa gec tea gca atg gat ggc gca ggt ccg ttc cag aac ttc ttt aag 1248 
Glu Ala Ser Ala Met Asp Gly Ala Gly Pro Phe Gin Asn Phe Phe Lys 
405 410 415 

att acg ctg ccg ctg ctg att aaa ccg ctg acg ccg ctg atg ate gee 1296 
He Thr Leu Pro Leu Leu He Lys Pro Leu Thr Pro Leu Met He Ala 
420 425 430 

age ttc gee ttt aac ttt aac aac ttc gtg ctg att caa ctg tta acc 134 4 

Ser Phe Ala Phe Asn Phe Asn Asn Phe Val Leu He Gin Leu Leu Thr 
435 440 445 

aac ggc ggc ccg gat cgt ctt ggc acg acc acg cca gec ggt tat acc 1392 
Asn Gly Gly Pro Asp Arg Leu Gly Thr Thr Thr Pro Ala Gly Tyr Thr 
450 455 460 

gac ctg ctt gtt aac tac acc tac cgc ate get ttt gaa ggc ggc ggg 1440 
Asp Leu Leu Val Asn Tyr Thr Tyr Arg He Ala Phe Glu Gly Gly Gly 
465 470 475 480 

ggt cag gac ttc ggt ctg gcg gca gca att gec acg ctg ate ttc ctg 1488 
Gly Gin Asp Phe Gly Leu Ala Ala Ala He Ala Thr Leu He Phe Leu 
485 490 495 

ctg gtg ggt gcg ctg gcg ata gtg aac ctg aaa gee acg cga atg aag 1536 
Leu Val Gly Ala Leu Ala He Val Asn Leu Lys Ala Thr Arg Met Lys 
500 505 510 



ttt gat taa 
Phe Asp * 



1545 



<210> 156 
<211> 891 
<212> DNA 

<213> Escherichia coli 



<220> 
<221> CDS 
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<222> (1) . . . (891) 
<400> 156 

atg gca atg gtc caa ccg aaa teg caa aaa get cgt tta ttt att act 48 

Met Ala Met Val Gin Pro Lys Ser Gin Lys Ala Arg Leu Phe He Thr 

15 10 15 

cac ctg eta ctg eta ctt ttt ate gca gcg att atg ttc ccg ctg ctg 96 

His Leu Leu Leu Leu Leu Phe He Ala Ala He Met Phe Pro Leu Leu 

20 25 30 

atg gtc gtc get ate teg ctg cgt cag gga aac ttt gcg ace ggc age 144 

Met Val Val Ala He Ser Leu Arg Gin Gly Asn Phe Ala Thr Gly Ser 

35 40 45 

ctg ate ccg gag caa ate tec tgg gat cac tgg aaa ctg gcg tta ggt 192 

Leu He Pro Glu Gin He Ser Trp Asp His Trp Lys Leu Ala Leu Gly 

50 55 60 

ttt age gtt gaa cag get gat ggt cgc att acg cca ccg cca ttc ccg 240 

Phe Ser Val Glu Gin Ala Asp Gly Arg He Thr Pro Pro Pro Phe Pro 

65 70 75 80 

gta ctg ctg tgg ctg tgg aac teg gta aag gtc gee ggg att tec gcg 288 

Val Leu Leu Trp Leu Trp Asn Ser Val Lys Val Ala Gly He Ser Ala 

85 90 95 

att ggc att gtg gcg etc tec acc acc tgc gee tac get ttc gee cgt 336 

He Gly lie Val Ala Leu Ser Thr Thr Cys Ala Tyr Ala Phe Ala Arg 

100 105 110 

atg cgc ttt cca ggc aaa gcg acg ctg ctg aaa gga atg ctg att ttc 384 

Met Arg Phe Pro Gly Lys Ala Thr Leu Leu Lys Gly Met Leu He Phe 

115 "* 120 125 

cag atg ttc ccg gca gta ctt tea ctg gtc gcg ttg tat gcg ttg ttt 4 32 

Gin Met Phe Pro Ala Val Leu Ser Leu Val Ala Leu Tyr Ala Leu Phe 

130 135 140 

gat cgt ctg ggt gag tac att cca ttc att ggc ctg aat act cac ggc 480 

Asp Arg Leu Gly Glu Tyr He Pro Phe lie Gly Leu Asn Thr His Gly 

145 150 155 160 

ggc gta att ttc gcg tat ctg ggt ggg att gcg ctg cat gtc tgg acc 528 

Gly Val He Phe Ala Tyr Leu Gly Gly He Ala Leu His Val Trp Thr 

165 170 175 

ate aaa ggc tat ttc gaa acc ate gac agt teg ctg gaa gaa get get 576 

He Lys Gly Tyr Phe Glu Thr He Asp Ser Ser Leu Glu Glu Ala Ala 

180 185 190 

gcg ctg gat ggt gcg aca ccg tgg cag gee ttc cgc ctt gtc ctg ttg 624 

Ala Leu Asp Gly Ala Thr Pro Trp Gin Ala Phe Arg Leu Val Leu Leu 

195 200 205 

ccg ctg tea gta ccg att ctg gcg gtg gta ttc ate ctg teg ttt ate 672 

Pro Leu Ser Val Pro He Leu Ala Val Val Phe He Leu Ser Phe lie 

210 215 220 

get gee att act gaa gtt ccg gtc gcg teg ctg tta ctg cgt gac gta 720 
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Ala Ala lie Thr Glu Val Pro Val Ala Ser Leu Leu Leu Arg Asp Val 
225 230 235 240 

aac age tac acc ctg gcc gtg ggg atg cag caa tac etc aac ccg caa 768 
Asn Ser Tyr Thr Leu Ala Val Gly Met Gin Gin Tyr Leu Asn Pro Gin 
245 250 255 

aac tac ctg tgg ggt gac ttt gcc gcc get gcc gtg atg tct gca tta- 816 
Asn Tyr Leu Trp Gly Asp Phe Ala Ala Ala Ala Val Met Ser Ala Leu 
260 265 270 

ccg ate acc ate gtc ttc ttg ctg get caa cgc tgg ctg gtc aac ggc 8 64 

Pro Tie Thr He Val Phe Leu Leu Ala Gin Arg Trp Leu Val Asn Gly 
275 280 285 

ctg acg gca ggt ggt gtg aaa ggt taa 8 91 

Leu Thr Ala Gly Gly Val Lys Gly * 
290 295 



<210> 157 
<211> 2022 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . . (2022) 

<400> 157 

atg cgt eta aac ccc ggc caa caa caa get gtc gaa ttc gtt acc ggc 48 
Met Arg Leu Asn Pro Gly Gin Gin Gin Ala Val Glu Phe Val Thr Gly 
15 10 15 

ccc tgc ctg gtg ctg gcg ggc gcg ggt tec ggt aaa act cgt gtt ate 96 
Pro Cys Leu Val Leu Ala Gly Ala Gly Ser Gly Lys Thr Arg Val He 
20 25 30 

acc aat aaa ate gcc cat ctg ate cgc ggt tgc ggt tat cag gcg egg 144 
Thr Asn Lys He Ala His Leu He Arg Gly Cys Gly Tyr Gin Ala Arg 
35 40 . 45 

cac att gcg gcg gtg acc ttt act aat aaa gca gcg cgc gag atg aaa 192 
Hi's He Ala Ala Val Thr Phe Thr Asn Lys Ala Ala Arg Glu Met Lys 
50 55 60 

gag cgt gta ggg cag acg ctg ggg cgc aaa gag gcg cgt ggg ctg atg 240 
Glu Arg Val Gly Gin Thr Leu Gly Arg Lys Glu Ala Arg Gly Leu Met 
65 ~ 70 75 80. 

ate tec act ttc cat acg ttg ggg ctg gat ate ate aaa cgc gag tat 288 
He Ser Thr Phe His Thr Leu Gly Leu Asp He He Lys Arg Glu Tyr 
85 90 95. 

gcg gcg ctt ggg atg aaa gcg aac ttc teg ttg ttt gac gat acc gat 336 
Ala Ala Leu Gly Met Lys Ala Asn Phe Ser Leu Phe Asp Asp Thr Asp 
100 105 110 



cag ctt get ttg ctt aaa gag ttg acc gag ggg ctg att gaa gat gac 
Gin Leu Ala Leu Leu Lys Glu Leu Thr Glu Gly Leu lie Glu Asp Asp 



384 
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115 120 125 

aaa gtt etc ctg caa caa ctg att teg ace ate tct aac tgg aag aat 432 
Lys Val Leu Leu Gin Gin Leu lie Ser Thr lie Ser Asn Trp Lys Asn 
130 135 140 

gat etc aaa aca ccg tec cag gcg gca gca agt gcg att ggc gag egg 480 
Asp Leu Lys Thr Pro Ser Gin Ala Ala Ala Ser Ala lie Gly Glu Arg 
145 150 155 160 

gac cgt att ttt gec cat tgt tat ggg ctg tat gat gca cac ctg aaa 528 
Asp Arg lie Phe Ala His Cys Tyr Gly Leu Tyr Asp Ala His Leu Lys 
165 170 175 

gec tgt aac gtt etc gac ttc gat gat ctg att tta ttg ccg acg ttg 576 
Ala Cys Asn Val Leu Asp Phe Asp Asp Leu lie Leu Leu Pro Thr Leu 
180 185 190 

ctg ctg caa gec aat gaa gaa gtc cgc aag cgc tgg cag aac aaa att 624 
Leu Leu Gin Ala Asn Glu Glu Val Arg Lys Arg Trp Gin Asn Lys lie 
195 200 205 

cgc tat ctg ctg gtg gat gag tat cag gac acc aac acc age cag tat 672 
Arg Tyr Leu Leu Val Asp Glu Tyr Gin Asp Thr Asn Thr Ser Gin Tyr 
210 215 220 

gag ctg gtg aaa ctg ctg gtg ggc age cgc gcg cgc ttt acc gtg gtg 720 
Glu Leu Val Lys Leu Leu Val Gly Ser Arg Ala Arg Phe Thr Val Val 
225 230 235 240 

ggt gac gat gac cag teg ate tac tec tgg cgc ggt gca cgt ccg caa 768 
Gly Asp Asp Asp Gin Ser lie Tyr Ser Trp Arg Gly Ala Arg Pro Gin 
245 250 255 

aac ctg gtg ctg ctg agt cag gat ttt ccg gcg ctg aag gtg att aag 816 
Asn Leu Val Leu Leu Ser Gin Asp Phe Pro Ala Leu Lys Val He Lys 
260 265 270 

ctt gag cag aac tat cgc tct tec ggg cgt att ctg aaa gcg gcg aac 864 
Leu Glu Gin Asn Tyr Arg Ser Ser Gly Arg He Leu Lys Ala Ala Asn 
275 280 285 

ate ctg ate gec aat aac ccg cac gtc ttt gaa aag cgt ctg ttc tec 912 
lie Leu He Ala Asn Asn Pro His Val Phe Glu Lys Arg Leu Phe Ser 
290 295 300 

gaa ctg ggt tat ggc gcg gag eta aaa gta tta age gcg aat aac gaa 960 
Glu Leu Gly Tyr Gly Ala Glu Leu Lys Val Leu Ser Ala Asn Asn Glu 
305 310 315 320 

gaa cat gag get gag cgc gtt act ggc gag ctg ate gec cat cac ttc 1008 
Glu His Glu Ala Glu Arg Val Thr Gly Glu Leu He Ala His His Phe 
325 330 335 

gtc aat aaa acg cag tac aaa gat tac gec att ctt tat cgc ggt aac 1056 
Val Asn Lys Thr Gin Tyr Lys Asp Tyr Ala He Leu Tyr Arg Gly Asn 
340 345 350 

cat cag teg egg gtg ttt gaa aaa ttc ctg atg caa aac cgc ate ccg 1104 
His Gin Ser Arg Val Phe Glu Lys Phe Leu Met Gin Asn Arg He Pro 
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355 360 365 

tac aaa ata tct ggt ggt acg teg ttt ttc tct cgt cct gaa ate aag 1152 
Tyr Lys He Ser Gly Gly Thr Ser Phe Phe Ser Arg Pro Glu He Lys 
370 375 380 

gac ttg ctg get tat ctg cgc gtg ctg act aac ccg gac gat gac age 1200 
Asp Leu Leu Ala Tyr Leu Arg Val Leu Thr Asn Pro Asp Asp Asp Ser 
385 390 395 400 

gca ttt ctg cgt ate gtt aac acg ccg aag cga gag att ggc ccg get 1248 
Ala Phe Leu Arg He Val Asn Thr Pro Lys Arg Glu He Gly Pro Ala 
405 410 415 

acg ctg aaa aag ctg ggt gag tgg gcg atg acg cgc aat aaa age atg 1296 
Thr Leu Lys Lys Leu Gly Glu Trp Ala Met Thr Arg Asn Lys Ser Met 
420 425 430 

ttt acc gec age ttt gat atg ggc ctg agt cag acg ctt age gga cgt 1344 
Phe Thr Ala Ser Phe Asp Met Gly Leu Ser Gin Thr Leu Ser Gly Arg 
435 440 445 

ggt tat gaa gca ttg acc cgc ttc act cac tgg ttg gca gaa ate cag 1392 
Gly Tyr Glu Ala Leu Thr Arg Phe Thr His Trp Leu Ala Glu He Gin 
450 455 " 460 

cgt ctg gcg gag egg gag ccg att gec gcg gtg cgt gat ctg ate cat 14 40 

Arg Leu Ala Glu Arg Glu Pro Tie Ala Ala Val Arg Asp Leu He His 
465 470 475 480 

ggc atg gat tat gaa tec tgg ctg tac gaa aca teg ccc age ccg aaa 1488 
Gly Met Asp Tyr Glu Ser Trp Leu Tyr Glu Thr Ser Pro Ser Pro Lys 
485 490 495 

gec gec gaa atg cgc atg aag aac gtc aac caa ctg ttt age tgg atg 1536 
Ala Ala Glu Met Arg Met Lys Asn Val Asn Gin Leu Phe Ser Trp Met 
500 " 505 510 

acg gag atg ctg gaa ggc agt gaa ctg gat gag ccg atg acg etc acc 1584 
Thr Glu Met Leu Glu Gly Ser Glu Leu Asp Glu Pro Met Thr Leu Thr 
515 520 525 

cag gtg gtg acg cgc ttt act ttg cgc gac atg atg gag cgt ggt gag 1632 
Gin Val Val Thr Arg Phe Thr Leu Arg Asp Met Met Glu Arg Gly Glu 
530 535 540 

agt gaa gaa gag ctg gat cag gtg caa ctg atg act etc cac gcg teg 1680 
Ser Glu Glu Glu Leu Asp Gin Val Gin Leu Met Thr Leu His Ala Ser 
545 550 555 560 

aaa ggg ctg gag ttt cct tat gtc tac atg gtc ggt atg gaa gaa ggg 1728 
Lys Gly Leu Glu Phe Pro Tyr Val Tyr Met Val Gly Met Glu Glu Gly 
565 570 575 

ttt ttg ccg cac cag age age ate gat gaa gat aat ate gat gag gag 1776 
Phe Leu Pro His Gin Ser Ser He Asp Glu Asp Asn He Asp Glu Glu 
580 585 590 

egg egg ctg gec tat gtc ggc att acc cgc gec cag aag gaa ttg acc 1824 
Arg Arg Leu Ala Tyr Val Gly He Thr Arg Ala Gin Lys Glu Leu. Thr 
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595 600 605 

ttt acg ctg tgt aaa gaa cgc cgt cag tac ggc gaa ctg gtg cgc ccg 1872 
Phe Thr Leu Cys Lys Glu Arg Arg Gin Tyr Gly Glu Leu Val Arg Pro 
610 615 620 

gag ccg age cgc ttt ttg ctg gag ctg ccg cag gat gat ctg att tgg 1920 
Glu Pro Ser Arg Phe Leu Leu Glu Leu Pro Gin Asp Asp Leu He Trp 
625 630 635 640 

gaa cag gag cgc aaa gtg gtc age gec gaa gaa egg atg cag aaa ggg 1968 
Glu Gin Glu Arg Lys Val Val Ser Ala Glu Glu Arg Met Gin Lys Gly 
645 650 655 

caa age cat ctg gcg aat ctg aaa gcg atg atg gcg gca aaa cga ggg 2016 
Gin Ser His Leu Ala Asn Leu Lys Ala Met Met Ala Ala Lys Arg Gly 
660 665 670 

aaa taa 2022 
Lys * 



<210> 158 
<211> 1212 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . - . (1212) 

<400> 158 

atg atg gtt aca gtt gtc age aat tat tgt caa tta tct caa acg caa 48 
Met Met Val Thr Val Val Ser Asn Tyr Cys Gin Leu Ser Gin Thr Gin 
15 10 15 

etc agt cag aca ttt gca gaa aaa ttt act gtg ace gag gaa tta ctg 96 
Leu Ser Gin Thr Phe Ala Glu Lys Phe Thr Val Thr Glu Glu Leu Leu 
20 25 30 

cag tct tta aaa aaa aca gcg tta tec gga gat gaa gaa age att gag 144 
Gin Ser Leu Lys Lys Thr Ala Leu Ser Gly Asp Glu Glu Ser He Glu 
35 40 45 

tta ctg cat aat att gcg tta ggt tat gat aaa ttt ggg aaa gaa get 192 
Leu Leu His Asn He Ala Leu Gly Tyr Asp Lys Phe Gly Lys Glu Ala 
50 55 60 

gaa gat att ctt tac cat att gtt aga ace cca aca aat gag ace eta 240 
Glu Asp He Leu Tyr His He Val Arg Thr Pro Thr Asn Glu Thr Leu 
65 70 75 80 

teg att ate cga ctt ate aaa aat get tgt tta aaa tta tat aat ctg 288 
Ser He He Arg Leu He Lys Asn Ala Cys Leu Lys Leu Tyr Asn Leu 
85 90 95 

gca cat ate gca ace aac tec ccc etc aaa tea cat gat tea gat gat 336 
Ala His He Ala Thr Asn Ser Pro Leu Lys Ser His Asp Ser Asp Asp 
100 105 110 
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etc ctg ttc aaa aaa eta ttc tec cct teg aaa tta atg aca att ate 384 
Leu Leu Phe Lys Lys Leu Phe Ser Pro Ser Lys Leu Met Thr lie lie 
115 120 125 

ggt gat gaa att cct ctt ata tct gaa aaa cag teg ctt tea aag gtg 432 
Gly Asp Glu He Pro Leu He Ser Glu Lys Gin Ser Leu Ser Lys Val 
130 135 140 

ctt tta aat gat gag aat aat gaa ctg agt gat ggt aca aac ttc tgg 480 
Leu Leu Asn Asp Glu Asn Asn Glu Leu Ser Asp Gly Thr Asn Phe Trp 
145 150 155 160 



gat aaa aat cgt caa tta acc aca gat gaa ata get tgc tat ctt cag 528 
Asp Lys Asn Arg Gin Leu Thr Thr Asp Glu He Ala Cys Tyr Leu Gin 
165 170 * 175 

aag ate gee get aat gca aaa aat act caa gtc aat tat cct act ggt 576 
Lys He Ala Ala Asn Ala Lys Asn Thr Gin Val Asn Tyr Pro Thr Gly 
180 185 190 

etc tac gtc ccc tac tec acc aga act cac ctg gaa gac get etc aat 624 
Leu Tyr Val Pro Tyr Ser Thr Arg Thr His Leu Glu Asp Ala Leu Asn 
195 200 205 

gaa aat att aag age gat cca tea tgg ccg aat gaa gtc cag tta ttc 672 
Glu Asn He Lys Ser Asp Pro Ser Trp Pro Asn Glu Val Gin Leu Phe 
210 215 220 

ccc ata aat act ggc gga cac tgg ata tta gtt teg eta cag aaa ata 720 
Pro He Asn Thr Gly Gly His Trp He Leu Val Ser Leu Gin Lys He 
225 230 235 240 

gta aat aaa aaa aat aat aaa eta caa ata aaa tgc gtc ata ttc aac 768 
Val Asn Lys Lys Asn Asn Lys Leu Gin He Lys Cys Val He Phe Asn 
245 250 255 

tea ttg cgt gca eta ggc tat gat aaa gaa aat tea ctt aag cgt gtc 816 
Ser Leu Arg Ala Leu Gly Tyr Asp Lys Glu Asn Ser Leu Lys Arg Val 
260 265 270 

att aat agt ttt aat tct gaa etc atg gga gaa atg teg aat aat aat 8 64 

lie Asn Ser Phe Asn Ser Glu Leu Met Gly Glu Met Ser Asn Asn Asn 
275 280 285 

ata aaa gtt cat tta aat gaa cca gag ata ata ttt tta cat gec gat 912 
He Lys Val His Leu Asn Glu Pro Glu lie lie Phe Leu His Ala Asp 
290 295 300 

ctt cag caa tac tta age caa agt tgc ggt gca ttt gtg tgc atg gca 960 
Leu Gin Gin Tyr Leu Ser Gin Ser Cys Gly Ala Phe Val Cys Met Ala 
305 310 315 320 

gee cag gaa gtg att gaa caa agg gaa age aat tct gac age gee ccc 1008 
Ala Gin Glu Val He Glu Gin Arg Glu Ser Asn Ser Asp. Ser Ala Pro 
325 330 335 

tat acg tta tta aaa aac cat get gac aga ttt aaa aaa tat tea gca 1056 
Tyr Thr Leu Leu Lys Asn His Ala Asp Arg Phe Lys Lys Tyr Ser Ala 
34 0 345 350 
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gaa gag cag tac gaa att 
Glu Glu Gin Tyr Glu He 
355 

tgt tat tta gat aaa tat 
Cys Tyr Leu Asp Lys Tyr 
370 

aac tta gaa ata, aaa cac 
Asn Leu Glu lie Lys His 
385 390 

aga gtg agt taa 
Arg Val Ser * 



gat ttt caa cat cga ctg 
Asp Phe Gin His Arg Leu 
360 

ggc gat gca aat ate aat 
Gly Asp Ala Asn He Asn 
375 380 

tea caa ecc aaa aat aga 
Ser Gin Pro Lys Asn Arg 
395 



gca aac aga aat 1104 

Ala Asn Arg Asn 

365 

cat tat tat aga 1152 
His Tyr Tyr Arg 



gca tec ggc aaa 1200 
Ala Ser Gly Lys 
400 

1212 



<210> 159 
<211> 891 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . . (891) 

<400> 159 

atg atg ttt aag caa tac ctg caa gta acg aaa cca ggc ate ate ttt 48 

Met Met Phe Lys Gin Tyr Leu Gin Val Thr Lys Pro Gly He He Phe 
15 10 15 

ggc aac ctg ate teg gtg att ggg gga ttc ctg ctg gee tea aag ggc 96 
Gly Asn Leu He Ser Val He Gly Gly Phe Leu Leu Ala Ser Lys Gly 
20 25 30 

age att gat tat ccc ctg ttt ate tac acg ctg gtt ggg gtg tea ctg 144 
Ser He Asp Tyr Pro Leu Phe lie Tyr Thr Leu Val Gly Val Ser Leu 
35 40 45 

gtt gtg gcg teg ggt tgt gtg ttt aac aac tac ate gac agg gat ate 192 
Val Val Ala Ser Gly Cys Val Phe Asn Asn Tyr He Asp Arg Asp He 
50 55 60 

gac aga aag atg gaa agg acg aag aat egg gtg ctg gtg aaa ggc ctg 240 
Asp Arg Lys Met Glu Arg Thr Lys Asn Arg Val Leu Val Lys Gly Leu 
65 70 75 80 

ate tct cct get gtc teg ctg gtg tac gee acg ttg ctg ggt att get 288 
He Ser Pro Ala Val Ser Leu Val Tyr Ala Thr Leu Leu Gly He Ala 
85 90 95 

ggc ttt atg ctg ctg tgg ttt ggc gcg aat ccg ctg gee tgc tgg ctg 336 
Gly Phe Met Leu Leu Trp Phe Gly Ala Asn Pro Leu Ala Cys Trp Leu 
100 105 110 

ggg gtg atg ggc ttt gtg gtt tat gtc ggc gtt tat age ctg tac atg 384 
Gly Val Met Gly Phe Val Val Tyr Val Gly Val Tyr Ser Leu Tyr Met 
115 120 125 
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aaa cgc cac tct gtc tac ggc acg ttg att ggt teg etc tec ggc get 4 32 

Lys Arg His Ser Val Tyr Gly Thr Leu lie Gly Ser Leu Ser Gly Ala 
130 135 140 

gcg ccg ccg gtg ate ggc tac tgt gcg gta ace ggt gag ttc gat age 4 80 

Ala Pro Pro Val lie Gly Tyr Cys Ala Val Thr Gly Glu Phe Asp Ser 
145 150 155 160 

ggc gca gcg ate ctg ctg get ate ttc age ctg tgg cag atg cct cac 528 
Gly Ala Ala He Leu Leu Ala He Phe Ser Leu Trp Gin Met Pro His 
165 170 175 

tec tat gee ate gee att ttc cgc ttt aag gat tac cag gcg gca aac 576 
Ser Tyr Ala He Ala He Phe Arg Phe Lys Asp Tyr Gin Ala Ala Asn 
180 185 190 

att ccg gta ttg cca gtg gta aaa ggc att teg gtg gcg aag aat cac 624 
He Pro Val Leu Pro Val Val Lys Gly He Ser Val Ala Lys Asn His 
195 200 205 

ate acg ctg tat ate ate gee ttt gee gtt gee acg ctg atg etc tct 672 
He Thr Leu Tyr He He Ala Phe Ala Val Ala Thr Leu Met Leu Ser 
210 215 220 

ctt ggc ggt tac get ggg tat aaa tat ctg gtg gtc gec gcg gcg gtt 720 
Leu Gly Gly Tyr Ala Gly Tyr Lys Tyr Leu Val Val Ala Ala Ala Val 
225 230 235 240 

age gtc tgg tgg tta ggt atg get ctg cgc ggt tat aaa gtt get gat 768 
Ser Val Trp Trp Leu Gly Met Ala Leu Arg Gly Tyr Lys Val Ala Asp 
245 ' 250 255 

gac aga ate tgg gcg cgc aag ctg ttc ggc ttc tct ate ate gee ate 816 
Asp Arg He Trp Ala Arg Lys Leu Phe Gly Phe Ser He He Ala He 
260 265 270 

act gee etc teg gtg atg atg tec gtt gat ttt atg gta ccg gac teg 864 
Thr Ala Leu Ser Val Met Met Ser Val Asp Phe Met Val Pro Asp Ser 
275 280 285 

cat acg ctg ctg get get gtg tgg taa 891 
His Thr Leu Leu Ala Ala Val Trp * 
290 295 



<210> 160 
<211> 948 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . . (948) 

<400> 160 

atg aga etc agg aaa tac aat aaa agt ttg gga tgg ttg tea tta ttt 48 
Met Arg Leu Arg Lys Tyr Asn Lys Ser Leu Gly Trp Leu Ser Leu Phe 
15 10 15 

gca ggc act gta ttg etc agt ggc tgt aat tct gcg ctg tta gat ccc 96 
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Ala Gly Thr Val Leu Leu Ser Gly Cys Asn Ser Ala Leu Leu Asp Pro 

20 25 30 

aaa gga cag att ggt ctg gag caa cgt tea ctg ata ctg acg gca ttt 144 

Lys Gly Gin lie Gly Leu Glu Gin Arg Ser Leu lie Leu Thr Ala Phe 

35 ~ 40 45 

ggc ctg atg ttg att gtc gtt att ccc gca ate ttg atg get gtt ggt 192 

Gly Leu Met Leu lie Val Val He Pro Ala He Leu Met Ala Val Gly 

50 55 60 

ttc gee tgg aag tac cgt gcg age aat aaa gat get aag tac age ccg 240 

Phe Ala Trp Lys Tyr Arg Ala Ser Asn Lys Asp Ala Lys Tyr Ser Pro 

65 70 75 80 

aac tgg tea cac tec aat aaa gtg gaa get gtg gtc tgg acg gta cct 288 

Asn Trp Ser His Ser Asn Lys Val Glu Ala Val Val Trp Thr Val Pro 

85 90 95 

ate tta ate ate ate ttc ctt gca gta ctg acc tgg aaa ace act cac 336 

He Leu He He He Phe Leu Ala Val Leu Thr Trp Lys Thr Thr His 

100 105 110 

get ctt gag cct age aag ccg ctg gca cac gac gag aag ccc att acc 384 

Ala Leu Glu Pro Ser Lys Pro Leu Ala His Asp Glu Lys Pro lie Thr 

115 120 125 

ate gaa gtg gtt tec atg gac tgg aaa tgg ttc ttc ate tac ccg gaa 432 

He Glu Val Val Ser Met Asp Trp Lys Trp Phe Phe He Tyr Pro Glu 

130 135 140 

cag ggc att get acc gtg aat gaa ate get ttc ccg gcg aac act ccg 4 80 

Gin Gly He Ala Thr Val Asn Glu He Ala Phe Pro Ala Asn Thr Pro 

145 " 150 155 160 

gtg tac ttc aaa gtg acc tec aac tec gtg atg aac tec ttc ttc att 528 

Val Tyr Phe Lys Val Thr Ser Asn Ser Val Met Asn Ser Phe Phe He 

165 170 175 

ccg cgt ctg ggt age cag att tat gee atg gee ggt atg cag act cgc 576 

Pro Arg Leu Gly Ser Gin He Tyr Ala Met Ala Gly Met Gin Thr Arg 

180 185 190 

ctg cat ctg ate gee aac gaa ccc ggc act tat gac ggt ate tec gee 624 

Leu His Leu He Ala Asn Glu Pro Gly Thr Tyr Asp Gly He Ser Ala 

195 200 205 

age tac age ggc ccg ggc ttc tea ggc atg aag ttc aaa get att gca 672 

Ser Tyr Ser Gly Pro Gly Phe Ser Gly Met Lys Phe Lys Ala He Ala 

210 215 220 

aca ccg gat cgc gec gca ttc gac cag tgg gtc gca aaa gcg aag cag 720 

Thr Pro Asp Arg Ala Ala Phe Asp Gin Trp Val Ala Lys Ala Lys Gin 

225 230 235 240 

teg ccg aac acc atg tct gac atg get gcg ttc gaa aaa ctg gec gcg 768 

Ser Pro Asn Thr Met Ser Asp Met Ala Ala Phe Glu Lys Leu Ala Ala 

245 250 255 

cct age gaa tac aac cag gtg gaa tat ttc tec aac gtg aaa cca gac 816 
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Pro Ser Glu Tyr Asn Gin 
260 

ttg ttt gcc gat gta att 
Leu Phe Ala Asp Val lie 
275 

gac atg acc cag cca gaa 
Asp Met Thr Gin Pro Glu 
290 

ggc atg gac atg age cac 
Gly Met Asp Met Ser His 
305 310 



Val Glu Tyr Phe Ser Asn 
265 

aac aag ttt atg get cac 
Asn Lys Phe Met Ala His 
280 

ggt gag cac age gca cac 
Gly Glu His Ser Ala His 
295 300 

gcg gaa tec gcc cat taa 
Ala Glu Ser Ala His * 
315 



Val Lys Pro Asp 
270 

ggt aag age atg 864 

Gly Lys Ser Met 

285 

gaa ggt atg gaa 912 
Glu Gly Met Glu 



948 



<210> 161 
<211> 1992 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . . (1992) 

<400> 161 

atg ttc gga aaa tta tea ctt gat gca gtc ccg ttc cat gaa cct ate 48 

Met Phe Gly Lys Leu Ser Leu Asp Ala Val Pro Phe His Glu Pro lie 
15 10 15 

gtc atg gtt acg ate get ggc att att ttg gga ggt ctg gcg etc gtt 96 
Val Met Val Thr lie Ala Gly lie He Leu Gly Gly Leu Ala Leu Val 
20 25 30 

ggc ctg ate act tac ttc ggt aag tgg acc tac ctg tgg aaa gag tgg 144 
Gly Leu He Thr Tyr Phe Gly Lys Trp Thr Tyr Leu Trp Lys Glu Trp 
35 40 45 

ctg acc tec gtc gac cat aaa cgc etc ggt ate atg tat ate ate gtg 192 
Leu Thr Ser Val Asp His Lys Arg Leu Gly He Met Tyr lie He Val 
50 * 55 60 

gcg att gtg atg ttg ctg cgt ggt ttt get gac gcc att atg atg cgt 240 
Ala He Val Met Leu Leu Arg Gly Phe Ala Asp Ala lie Met Met Arg 
65 70 75 80 

age cag cag get ctt gcc teg gcg ggc gaa gcg ggc ttc ctg cca cct 288 
Ser Gin Gin Ala Leu Ala Ser Ala Gly Glu Ala Gly Phe Leu Pro Pro 
85 90 95 

cac cac tac gat cag ate ttt acc gcg cac ggc gtg att atg ate ttc 336 
His His Tyr Asp Gin He Phe Thr Ala His Gly Val He Met He Phe 
100 105 110 

ttc gtg gcg atg cct ttc gtt ate ggt ctg atg aac ctg gtg gtt ccg 384 
Phe Val Ala Met Pro Phe Val He Gly Leu Met Asn Leu Val Val Pro 
115 120 125 

ctg cag ate ggc gcg cgt gac gtt gcg ttc ccg ttc etc aac aac tta 432 
Leu Gin He Gly Ala Arg Asp Val Ala Phe Pro Phe Leu Asn Asn Leu 
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130 135 140 

age ttc tgg ttt acc gtt gtt ggt gtg att ctg gtt aac gtt tct etc 4 80 

Ser Phe Tip Phe Thr Val Val Gly Val He Leu Val Asn Val Ser Leu 
145 150 155 160 

ggc gtg ggc gaa ttt gcg cag acc ggc tgg ctg gec tat cca ccg eta 528 

Gly Val Gly Glu Phe Ala Gin Thr Gly Trp Leu Ala Tyr Pro Pro Leu 
165 170 175 

teg gga ata gag tac agt ccg gga gtc ggt gtc gat tac tgg ata tgg 57 6 

Ser Gly He Glu Tyr Ser Pro Gly Val Gly Val Asp Tyr Trp He Trp 
180 185 190 

agt etc cag eta tec ggt ata ggt acg acg ctt acc ggt ate aac ttc 624 

Ser Leu Gin Leu Ser Gly He Gly Thr Thr Leu Thr Gly He Asn Phe 
195 200 205 

ttc gtt acc att ctg aag atg cgc gca ccg ggc atg acc atg ttc aag 672 

Phe Val Thr He Leu Lys Met Arg Ala Pro Gly Met Thr Met Phe Lys 

210 215 220 

atg cca gta ttt acc tgg gca tea ctg tgc gcg aac gta ctg att att 720 

Met Pro Val Phe Thr Trp Ala Ser Leu Cys Ala Asn Val Leu He He 
225 230 235 240 

get tec ttc cca att ctg acg gtt acc gtc gcg ttg ttg acc ctg gat 7 68 

Ala Ser Phe Pro He Leu Thr Val Thr Val Ala Leu Leu Thr Leu Asp 
245 250 255 

cgc tat ctg ggc acc cat ttc ttt acc aac gat atg ggt ggc aac atg 816 

Arg Tyr Leu Gly Thr His Phe Phe Thr Asn Asp Met Gly Gly Asn Met 
260 265 270 

atg atg tac ate aac ctg att tgg gec tgg ggc cac ccg gaa gtt tac 8 64 

Met Met Tyr He Asn Leu He Trp Ala Trp Gly His Pro Glu Val Tyr 
275 280 285 

ate ctg ate ctg cct gtt ttc ggt gtg ttc tec gaa att gcg gca acc 912 

He Leu He Leu Pro Val Phe Gly Val Phe Ser Glu He Ala Ala Thr 

290 295 300 

ttc teg cgt aaa cgt ctg ttt ggt tat acc teg ctg gta tgg gca acc 960 

Phe Ser Arg Lys Arg Leu Phe Gly Tyr Thr Ser Leu Val Trp Ala Thr 
305 310 315 320 

gtc tgt ate acc gtg ctg teg ttc ate gtt tgg ctg cac cac ttc ttt 1008 

Val Cys He Thr Val Leu Ser Phe He Val Trp Leu His His Phe Phe 
325 330 335 

acg atg ggt gcg ggc gcg aac gta aac gec ttc ttt ggt ate acc aca 1056 

Thr Met Gly Ala Gly Ala Asn Val Asn Ala Phe Phe Gly He Thr Thr 
340 345 350 



atg att ate gec ate ccg acc ggg gtg aag ate ttc aac tgg ctg ttc 1104 
Met He He Ala He Pro Thr Gly Val Lys He Phe Asn Trp Leu Phe 
355 360 365 

acc atg tat cag ggc cgc ate gtg ttc cat tct gcg atg ctg tgg acc 1152 
Thr Met Tyr Gin Gly Arg He Val Phe His Ser Ala Met Leu Trp Thr 
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370 375 380 

ate ggt ttt ate gtc acc ttc teg gtg ggc ggg atg act ggc gtg ctg 1200 
He Gly Phe He Val Thr Phe Ser Val Gly Gly Met Thr Gly Val Leu 
385 * 390 395 400 

ctg gee gta ccg ggc gcg gac ttc gtt ctg cat aac age ctg ttc ctg 1248 
Leu Ala Val Pro Gly Ala Asp Phe Val Leu His Asn Ser Leu Phe Leu 
405 410 415 

att gcg cac ttc cat aac gtg ate ate ggc ggc gtg gtc ttc ggc tgc 1296 
He Ala His Phe His Asn Val He He Gly Gly Val Val Phe Gly Cys 
420 425 430 

ttc gca ggg atg acc tac tgg tgg cct aaa gcg ttc ggt ttc aaa ctg 1344 
Phe Ala Gly Met Thr Tyr Trp Trp Pro Lys Ala Phe Gly Phe Lys Leu 
435 440 445 

aac gaa acc tgg ggt aaa cgc gcg ttc tgg ttc tgg ate ate ggc ttc 1392 
Asn Glu Thr Trp Gly Lys Arg Ala Phe Trp Phe Trp He He Gly Phe 
450 455 460 

ttc gtt gec ttt atg cca ctg tat gcg ctg ggc ttc atg ggc atg acc 1440 
Phe Val Ala Phe Met Pro Leu Tyr Ala Leu Gly Phe Met Gly Met Thr 
465 470 475 480 

cgt cgt ttg age cag cag att gac ccg cag ttc cac acc atg ctg atg 14 88 

Arg Arg Leu Ser Gin Gin He Asp Pro Gin Phe His Thr Met Leu Met 
485 490 495 

att gca gee age ggt gca gta ctg att gcg ctg ggt att etc tgc etc 1536 
He Ala Ala Ser Gly Ala Val Leu He Ala Leu Gly He Leu Cys Leu 
500 * 505 510 

gtt att cag atg tac gtt tct att cgc gac cgc gac cag aac cgt gac 1584 
Val He Gin Met Tyr Val Ser He Arg Asp Arg Asp Gin Asn Arg Asp 
515 520 525 

ctg act ggc gac ccg tgg ggt ggc cgt acg ctg gag tgg gca acc tct 1632 
Leu Thr Gly Asp Pro Trp Gly Gly Arg Thr Leu Glu Trp Ala Thr Ser 
530 535 540 

tec ccg cct ccg ttc tat aac ttt gec gta gtg ccg cac gtt cac gaa 1680 
Ser Pro Pro Pro Phe Tyr Asn Phe Ala Val Val Pro His Val His Glu 
545 550 555 560 

cgt gat gca ttc tgg gaa atg aaa gag aaa ggc gaa gcg tat aaa aag 1728 
Arg Asp Ala Phe Trp Glu Met Lys Glu Lys Gly Glu Ala Tyr Lys Lys 
565 570 575 

cct gac cac tat gaa gaa att cat atg ccg aaa aac age ggt gca ggt 1776 
Pro Asp His Tyr Glu Glu lie His Met Pro Lys Asn Ser Gly Ala Gly 
580 585 590 

ate gtc att gca get ttc tec acc ate ttc ggt ttc gee atg ate tgg , 1824 
He Val He Ala Ala Phe Ser Thr He Phe Gly Phe Ala Met He Trp 
595 600 605 

cat ate tgg tgg ctg gcg att gtt ggc ttc gca ggc atg ate ate acc 1872 
His He Trp Trp Leu Ala He Val Gly Phe Ala Gly Met He He Thr 
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610 615 620 

tgg ate gtg aaa age ttc gac gag gac gtg gat tac tac gtg ccg gtg 1920 
Trp lie Val Lys Ser Phe Asp Glu Asp Val Asp Tyr Tyr Val Pro Val 
625 630 635 640 

gca gaa ate gaa aaa ctg gaa aac cag cat ttc gat gag att act aag 1968 
Ala Glu lie Glu Lys Leu Glu Asn Gin His Phe Asp Glu He Thr Lys 
645 650 655 

gca ggg ctg aaa aat ggc aac tga 1992 
Ala Gly Leu Lys Asn Gly Asn * 
660 



<210> 162 
<211> 615 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . . (615) 

<400> 162 

atg gca act gat act ttg acg cac gcg act gec cac gcg cac gaa cac 48 

Met Ala Thr Asp Thr Leu Thr His Ala Thr Ala His Ala His Glu His 
15 10 15 

ggg cac cac gat gca ggc gga acc aaa ate ttc gga ttt tgg ate tac' 96 
Gly His His Asp Ala Gly Gly Thr Lys He Phe Gly Phe Trp He Tyr 
20 25 30 

ctg atg age gac tgc att ctg ttc tct ate ttg ttt get acc tat gec 144 
Leu Met Ser Asp Cys He Leu Phe Ser He Leu Phe Ala Thr Tyr Ala 
35 40 4b 

gtt ctg gtg aac ggc acc gca ggc ggc ccg aca ggt aag gac att ttc 192 
Val Leu Val Asn Gly Thr Ala Gly Gly Pro Thr Gly Lys Asp He Phe 
50 55 60 

gaa ctg ccg ttc gtt ctg gtt gaa act ttc ttg ctg ttg ttc age tec 240 
Glu Leu Pro Phe Val Leu Val Glu Thr Phe Leu Leu Leu Phe Ser Ser 
65 70 75 80 

ate acc tac ggc atg gcg get ate gec atg tac aaa aac aac aaa age 288 
He Thr Tyr Gly Met Ala Ala He Ala Met Tyr Lys Asn Asn Lys Ser 
85 90 95 

cag gtt ate tec tgg ctg gcg ttg acc tgg ttg ttt ggt gee gga ttt 336 
Gin Val He Ser Trp Leu Ala Leu Thr Trp Leu Phe Gly Ala Gly Phe 
100 105 110 

ate ggg atg gaa ate tat gaa ttc cat cac ctg att gtt aac ggc atg 384 
He Gly Met Glu He Tyr Glu Phe His His Leu He Val Asn Gly Met 
115 120 125 

ggt ccg gat cgc age ggc ttc ctg tea gcg ttc ttt gcg ctg gtc ggc 432 
Gly Pro Asp Arg Ser Gly Phe Leu Ser Ala Phe Phe Ala Leu Val Gly 
130 135 140 
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acg cac ggt ctg cac gtc act tct ggt ctt ate tgg atg gcg gtg ctg 480 
Thr His Gly Leu His Val Thr Ser Gly Leu He Trp Met Ala Val Leu 
145 150 155 160 

atg gtg caa ate gcc cgt cgc ggc ctg acc age act aac cgt acc cgc 528 
Met Val Gin He Ala Arg Arg Gly Leu Thr Ser Thr Asn Arg Thr Arg 
165 170 175 

ate atg tgc ctg age ctg ttc tgg cac ttc ctg gat gtg gtt tgg ate 576 
He Met Cys Leu Ser Leu Phe Trp His Phe Leu Asp Val Val Trp He 
180 185 190 

tgt gtg ttc act gtt gtt tat ctg atg ggg gcg atg taa 615 
Cys Val Phe Thr Val Val Tyr Leu Met Gly Ala Met * 
195 200 



<210> 163 
<211> 330 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . . (330) 

<400> 163 

atg agt cat tct acc gat cac age ggc gcg tec cat ggc age gta aaa 48 

Met Ser His Ser Thr Asp His Ser Gly Ala Ser His Gly Ser Val Lys 
15 10 15 

acc tac atg aca ggc ttt ate ctg teg ate att ctg acg gtg att ccg 96 
Thr Tyr Met Thr Gly Phe He Leu Ser He He Leu Thr Val He Pro 
20 25 30 

ttc tgg atg gtg atg aca gga get gcc tct ccg gcc gta att ctg gga 144 
Phe Trp Met Val Met Thr Gly Ala Ala Ser Pro Ala Val He Leu Gly 
35 40 45 

aca ate ctg gca atg gca gtg gta cag gtt ctg gtg cat ctg gtg tgc 192 
Thr He Leu Ala Met Ala Val Val Gin Val Leu Val His Leu Val Cys 
50 55 60 

ttc ctg cac atg aat acc aaa tea gat gaa ggc tgg aac atg acg gcg 240 
Phe Leu His Met Asn Thr Lys Ser Asp Glu Gly Trp Asn Met Thr Ala 
65 70 75 80 

ttt gtc ttc acc gtg eta ate ate get ate ctg gtt gta ggc tec ate 288 
Phe Val Phe Thr Val Leu He He Ala He Leu Val Val Gly Ser He 
85 90 95 

tgg att atg tgg aac etc aac tac aac atg atg atg cac taa 330 
Trp He Met Trp Asn Leu Asn Tyr Asn Met Met Met His * 
100 105 



<210> 164 
<211> 1764 
<212> DNA 
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<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . . (1764) 

<400> 164 

atg cag tgg caa act aaa etc ccg ctg ate gec att ttg cgc ggt att 48 

Met Gin Trp Gin Thr Lys Leu Pro Leu lie Ala lie Leu Arg Gly lie 

15 10 15 

acg ccc gac gag gcg ctg gcg cat gtt ggc gcg gtg att gac gec ggg 96 

Thr Pro Asp Glu Ala Leu Ala His Val Gly Ala Val He Asp Ala Gly 

20 25 30 

ttc gac gcg gtt gaa ate ccg ctg aat tec cca caa tgg gag caa age 144 

Phe Asp Ala Val Glu He Pro Leu Asn Ser Pro Gin Trp Glu Gin Ser 

35 40 45 

att ccc gee ate gtt gat gcg tac ggc gac aag gcg ttg att ggc gca 192 

He Pro Ala lie Val Asp Ala Tyr Gly Asp Lys Ala Leu lie Gly Ala 

50 55 60 

ggt acg gta ctg aaa cct gaa cag gtc gat gcg etc gee agg atg ggc 240 

Gly Thr Val Leu Lys Pro Glu Gin Val Asp Ala Leu Ala Arg Met Gly 

65 70 75 " 80 

tgt cag etc ate gtt acg ccc aat ate cat agt gaa gtg ate cgc cgt 288 

Cys Gin Leu lie Val Thr Pro Asn He His Ser Glu Val He Arg Arg 

85 90 95 

gcg gtg ggc tac ggc atg ace gtc tgc ccc ggc tgc gcg acg gcg acc 336 

Ala Val Gly Tyr Gly Met Thr Val Cys Pro Gly Cys Ala Thr Ala Thr 

100 105 110 

gaa gee ttt acc gcg etc gaa gcg ggc gec gca ggc get gaa aat att 384 

Glu Ala Phe Thr Ala Leu Glu Ala Gly Ala Ala Gly Ala Glu Asn He 

115 120 125 

tec gtc ate ggc ttt tgg tec gca ata cat caa age gtt aaa age ggt 4 32 

Ser Val He Gly Phe Trp Ser Ala He His Gin Ser Val Lys Ser Gly 

130 135 140 

att gee ate gga cat cgc agt ctt tgc cgt tgg egg cgt gac gca gaa 4 80 

He Ala He Gly His Arg Ser Leu Cys Arg Trp Arg Arg Asp Ala Glu 

145 150 155 160 

aac ctg gcg cag tgg ata gac gca ggt tgt gca ggg gcg ggc tta ggc 528 

Asn Leu Ala Gin Trp He Asp Ala Gly Cys Ala Gly Ala Gly Leu Gly 

165 170 175 

age gat etc tat cgc gee ggg caa tec gta gag cgc acc gcg cag cag 576 

Ser Asp Leu Tyr Arg Ala Gly Gin Ser Val Glu Arg Thr Ala Gin Gin 

180 185 190 

gca gca gca ttt gtt aag gcg tat cga gag gca ggt gca atg aaa ate 624 

Ala Ala Ala Phe Val Lys Ala Tyr Arg Glu Ala Gly Ala Met Lys He 

195 200 205 

acc aaa att acc acg tat cgt tta cct ccc cgc tgg atg ttc ctg aaa 672 
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Thr Lys lie Thr Thr Tyr Axg Leu Pro Pro Arg Trp Met Phe Leu Lys 
210 215 220 

att gaa acc gat gaa ggc gtg gtc ggt tgg ggc gag ccc gtg ate gaa 720 
He Glu Thr Asp Glu Gly Val Val Gly Trp Gly Glu Pro Val He Glu 
225 230 235 240 

ggc cgc gec cgt acg gtg gaa gcg gca gtt cac gag ctg ggt gac tat 768 
Gly Arg Ala Arg Thr Val Glu Ala Ala Val His Glu Leu Gly Asp Tyr 
245 250 255 

ttg att ggt cag gat cca teg cgc ate aat gac tta tgg caa gtg atg . 816 
Leu He Gly Gin Asp Pro Ser Arg lie Asn Asp Leu Trp Gin Val Met 
260 265 270 

tat cgc gee gga ttc tat cgc ggc ggt ccg ate ctg atg age gee ate 864 
Tyr Arg Ala Gly Phe Tyr Arg Gly Gly Pro He Leu Met Ser Ala He 
275 280 285 

gee ggg att gac cag gcg tta tgg gat ate aaa ggt aaa gtg ctg aat 912 
Ala Gly He Asp Gin Ala Leu Trp Asp lie Lys Gly Lys Val Leu Asn 
290 295 300 

gcg ccg gtc tgg caa ctg atg ggc ggc ctg gtt cgc gac aaa att aaa 960 
Ala Pro Val Trp Gin Leu Met Gly Gly Leu Val Arg Asp Lys He Lys 
305 310 315 320 

gee tac agt tgg gtt ggc ggc gat cgt ccg gcg gat gtt ate gac ggc 1008 
Ala Tyr Ser Trp Val Gly Gly Asp Arg Pro Ala Asp Val He Asp Gly 
325 330 335 

att aaa acg eta cgc gaa ate ggc ttc gat acc ttc aaa ctg aac ggt 1056 
He Lys Thr Leu Arg Glu He Gly Phe Asp Thr Phe Lys Leu Asn Gly 
340 345 350 

tgt gaa gaa ctg ggg eta att gat aac tec cgc gcg gta gat gcg gcg 1104 
Cys Glu Glu Leu Gly Leu He Asp Asn Ser Arg Ala Val Asp Ala Ala 
355 360 365 

gtt aac acc gtg gca caa att cgt gaa get ttt ggc aat cag att gag 1152 
Val Asn Thr Val Ala Gin He Arg Glu Ala Phe Gly Asn Gin He Glu 
370 375 380 

ttt ggt ctt gat ttc cac ggt cgc gtc age gcg ccg atg gcg aaa gtg 1200 
Phe Gly Leu Asp Phe His Gly Arg Val Ser Ala Pro Met Ala Lys Val 
385 ' 390 395 400 

ctg att aaa gaa ctg gag ccg tat cgc ccg ctg ttt att gag gag ccg 1248 
Leu lie Lys Glu Leu Glu Pro Tyr Arg Pro Leu Phe lie Glu Glu Pro 
405 410 415 

gtg ctg gcg gaa caa gee gaa tac tac ccg aaa ctg gcg gca caa acg 1296 
Val Leu Ala Glu Gin Ala Glu Tyr Tyr Pro Lys Leu Ala Ala Gin Thr 
420 425 430 

cat att cca ctg gcg gcg ggt gaa cgc atg ttc tea cgc ttc gat ttt 1344 
His lie Pro Leu Ala Ala Gly Glu Arg Met Phe Ser Arg Phe Asp Phe 
435 440 445 

aaa cgc gtg ctg gag gca ggt ggt att teg att ctg caa ccg gat etc 1392 
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Lys Arg Val Leu Glu Ala Gly Gly lie Ser He Leu Gin Pro Asp Leu 
450 455 460 

tec cac gcg ggc ggt att acc gaa tgc tac aaa ate gec gga atg gca 1440 
Ser His Ala Gly Gly He Thr Glu Cys Tyr Lys He Ala Gly Met Ala 
465 470 475 480 

gaa gee tat gac gtg acc ctt gcg ccg cac tgt ccg etc gga ccg att 1488 
Glu Ala Tyr Asp Val Thr Leu Ala Pro His Cys Pro Leu Gly Pro He 
485 490 495 

gca ctg gcg get tgc ctg cat ate gac ttt gtt tec tat aac gee gta 1536 
Ala Leu Ala Ala Cys Leu His He Asp Phe Val Ser Tyr Asn Ala Val 
500 505 ^ 510 

ctt cag gaa caa agt atg gga att cat tac aac aaa ggc gcg gag tta 1584 
Leu Gin Glu Gin Ser Met Gly He His Tyr Asn Lys Gly Ala Glu Leu 
515 520 525 

etc gac ttt gtg aaa aac aaa gaa gac ttc age atg gtc ggc ggc ttc 1632 
Leu Asp Phe Val Lys Asn Lys Glu Asp Phe Ser Met Val Gly Gly Phe 
530 535 540 

ttt aaa ccg tta acg aaa ccg ggc tta ggc gtg gaa ate gac gaa get 1680 
Phe Lys Pro Leu Thr Lys Pro Gly Leu Gly Val Glu He Asp Glu Ala 
545 550 555 560 

aaa gtg att gag ttc agt aaa aat gee ccg gac tgg cgt aat ccg etc 1728 
Lys Val He Glu Phe Ser Lys Asn Ala Pro Asp Trp Arg Asn Pro Leu 
565 570 575 

tgg cgt cat gaa gat aac age gta gca gag tgg taa 1764 
Trp Arg His Glu Asp Asn Ser Val Ala Glu Trp * 
580 585 



<210> 165 
<211> 879 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . . (879) 

<400> 165 

atg aca get cgc tac ate gca att gac tgg gga teg acc aat ctg cgc 4 8 

Met Thr Ala Arg Tyr He Ala He Asp Trp Gly Ser Thr Asn Leu Arg 
15 10 15 

gee tgg ctt tat cag ggc gac cac tgc ctg gag age agg caa tea gaa 96 
Ala Trp Leu Tyr Gin Gly Asp His Cys Leu Glu Ser Arg Gin Ser Glu 
20 25 30 

gca ggc gtc acg cgc ctg aac gga aaa tct ccg get gcg gtg tta gca 144 
Ala Gly Val Thr Arg Leu Asn Gly Lys Ser Pro Ala Ala Val Leu Ala 
35 " 40 45 

gaa gtc acg acc gac tgg cgt gaa gag aaa acg cca gtg gta atg gca 192 
Glu Val Thr Thr Asp Trp Arg Glu Glu Lys Thr Pro Val Val Met Ala 
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50 55 60 

gga atg gtt ggc age aac gtc ggc tgg aaa gtt gca ccg tat tta tct 240 
Gly Met Val Gly Ser Asn Val Gly Trp Lys Val Ala Pro Tyr Leu Ser 
65 70 75 80 

gtt cct gec tgt ttt teg tct att ggc gaa caa tta acg tea gtt ggc 288 
Val Pro Ala Cys Phe Ser Ser lie Gly Glu Gin Leu Thr Ser Val Gly 
85 90 95 

gac aat ate tgg att att ccc gga tta tgt gtc tct cat gac gat aac 336 
Asp Asn lie Trp lie lie Pro Gly Leu Cys Val Ser His Asp Asp Asn 
100 105 110 

cac aat gtg atg cgc ggc gaa gaa aca caa ttg ate ggc gcg cga get 384 
His Asn Val Met Arg Gly Glu Glu Thr Gin Leu lie Gly Ala Arg Ala 
115 120 125 

ctg get cct tec tct ctt tat gtc atg ccc gga ace cat tgc aaa tgg 4 32 

Leu Ala Pro Ser Ser Leu Tyr Val Met Pro Gly Thr His Cys Lys Trp 
130 135 140 

gtg cag gee gat age cag caa ate aac gat ttt cgc ace gtg atg ace 4 80 

Val Gin Ala Asp Ser Gin Gin lie Asn Asp Phe Arg Thr Val Mel; Thr 
145 150 155 160 

ggt gaa tta cat cat tta ctg tta aat cac tea ttg att ggc gca ggt 528 
Gly Glu Leu His His Leu Leu Leu Asn His Ser Leu lie Gly Ala Gly 
165 170 175 

ttg ccg ccg cag gaa aac tct gee gat gee ttc aca get ggc ctt gag 576 
Leu Pro Pro Gin Glu Asn Ser Ala Asp Ala Phe Thr Ala Gly Leu Glu 
180 185 190 

cgt ggt ctt aat acg ccc gee ata ttg ccg cag ctt ttt gaa gtt cgc 624 
Arg Gly Leu Asn Thr Pro Ala lie Leu Pro Gin Leu Phe Glu Val Arg 
195 200 205 

gee teg cat gtg ctg gga aca ctt ccc cgc gaa cag gtc age gaa ttt 672 
Ala Ser His Val Leu Gly Thr Leu Pro Arg Glu Gin Val Ser Glu Phe 
210 215 220 

etc tct ggt ttg ttg att ggc gca gag gtc gee agt atg cgc gac tat 720 
Leu Ser Gly Leu Leu lie Gly Ala Glu Val Ala Ser Met Arg Asp Tyr 
225 230 235 240 

gtg gee cat caa cac gec ate acc ctt gtc gee gga aca teg ctg ace 7 68 

Val Ala His Gin His Ala lie Thr Leu Val Ala Gly Thr Ser Leu Thr 
245 250 '255 

gcg cgc tac cag caa gec ttt cag gcg atg ggt tgc gac gtg acg gcg 816 
Ala Arg Tyr Gin Gin Ala Phe Gin Ala Met Gly Cys Asp Val Thr Ala 
260 265 270 

gtg gcg ggc gac acg gca ttt cag get ggt ata agg age ate get cat 864 
Val Ala Gly Asp Thr Ala Phe Gin Ala Gly He Arg Scr He Ala His 
275 280 285 

gca gtg gca aac taa 879 
Ala Val Ala Asn * 



-81- 



WO 01/34810 



PCT/USOO/30950 



290 



<210> 166 








<211> 387 








<212> DNA 








<213> Escherichia coli 








<220> 








<221> CDS 








<222> (1) . . . (387) 








<400> 166 








atg act etc aat aaa acc gat 


cgc 


att 


gtc 


Met Thr Leu Asn Lys Thr Asp 


Arg 


lie 


Val 


1 5 






10 



48 



15 



ate gtt cac ggc aaa tac gtg cca ggc teg ccg ctt ccg get gag gcg 96 
lie Val His Gly Lys Tyr Val Pro Gly Ser Pro Leu Pro Ala Glu Ala 
20 25 30 

gaa etc tgt gag gag ttt gca acc teg cgc aac ate ate cgt gag gtg 144 
Glu Leu Cys Glu Glu Phe Ala Thr Ser Arg Asn lie lie Arg Glu Val 
35 40 45 

ttc cgt teg ctg atg gcg aag egg ctg att gaa atg aaa cgt tat cgc 192 
Phe Arg Ser Leu Met Ala Lys Arg Leu lie Glu Met Lys Arg Tyr Arg 
50 55 60 

ggg gcg ttt gtg gca ccg cgt aac cag tgg aat tac etc gac act gac 240 
Gly Ala Phe Val Ala Pro Arg Asn Gin Trp Asn Tyr Leu Asp Thr Asp 
65 70 75 80 

gta ctg caa tgg gtg ctg gaa aat gac tac gac cca egg ctt ate agt 288 
Val Leu Gin Trp Val Leu Glu Asn Asp Tyr Asp Pro Arg Leu lie Ser 
85 90 95 

gee atg age gaa gtg cga aat ctg gtg gaa ccg gcg att gee cgt tgg 336 
Ala Met Ser Glu Val Arg Asn Leu Val Glu Pro Ala lie Ala Arg Trp 
100 105 110 

gag cag age gcg cga ctt cca gcg ate tgg cgc aga ttg aat egg cgc 384 
Glu Gin Ser Ala Arg Leu Pro Ala lie Trp Arg Arg Leu Asn Arg Arg 
115 120 125 

tga 387 



<210> 167 
<211> 297 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . . (297) 

<400> 167 
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atg att gcc aac aat cag gac cgc gaa gcg ttt aac gaa gcg gat att 48 
Met lie Ala Asn Asn Gin Asp Arg Glu Ala Phe Asn Glu Ala Asp lie 
1 5 10 15 

cgc tac cac gag gcg gtg ctg cag tcq gtg cat aac ccg gtg tta cag 96 
Arg Tyr His Glu Ala Val Leu Gin Ser Val His Asn Pro Val Leu Gin 
20 25 30 

caa ctt age att gcg ate agt teg ctg cag egg gcg gtt ttt gaa cga 14 4 

Gin Leu Ser lie Ala lie Ser Ser Leu Gin Arg Ala Val Phe Glu Arg 
35 40 45 

acc tgg atg ggc gat gag gcc aac atg ccg caa acg etc cag gaa cat 192 
Thr Trp Met Gly Asp Glu Ala Asn Met Pro Gin Thr Leu Gin. Glu His 
50 55 60 

aag gcg ctg ttc gat gcg ata cga cat cag gac ggc gat gcg gca gag 240 
Lys Ala Leu Phe Asp Ala lie Arg His Gin Asp Gly Asp Ala Ala Glu 
65 70 75 80 

cag gcg gcg ctt acc atg ate gcc age teg aca cga agg tta aag gaa 288 
Gin Ala Ala Leu Thr Met lie Ala Ser Ser Thr Arg Arg Leu Lys Glu 
85 , 90 95 

ate aca tga 297 
He Thr * 



<210> 168 
<211> 357 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . . (357) 

<400> 168 

atg get cgc gta aaa cgt ggt gtt att gca cgt gca cgt cac aag aaa 48 

Met Ala Arg Val Lys Arg Gly Val He Ala Arg Ala Arg His Lys Lys 

1.5 .10 15 

att ttg aaa caa get aaa ggc tac tac ggt gcg cgt tct cgc gta tac 96 
He Leu Lys Gin Ala Lys Gly Tyr Tyr Gly Ala Arg Ser Arg Val Tyr 
20 25 30 

cgc gtt gcc ttc cag get gtt ate aaa get ggt cag tat get tac cgt 144 
Arg Val Ala Phe Gin Ala Val He Lys Ala Gly Gin Tyr Ala Tyr Arg 
35 40 45 

gac cgt cgt caa cgt aag cgt cag ttc cgt caa ctg tgg att gcg cgt 192 
Asp Arg Arg Gin Arg Lys Arg Gin Phe Arg Gin Leu Trp He Ala Arg 
50 55 60 

ate aac gca gca gca cgt cag aac ggt att tct tac age aaa ttc ate 240 
He Asn Ala Ala Ala Arg Gin Asn Gly He Ser Tyr Ser Lys Phe He 
65 70 75 80 

aat ggc ctg aaa aaa gcc tct gtt gaa ate gac cgt aag ate ctg get 288 
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Asn Gly Leu Lys Lys Ala Ser Val Glu He Asp Arg Lys He Leu Ala 
85 90 95 

gat ate gca gta ttc gac aaa gta gcg ttc acc get ctg gtt gaa aaa 336 

Asp He Ala Val Phe Asp Lys Val Ala Phe Thr Ala Leu Val Glu Lys 
100 105 110 

gcg aaa gca get ctg gca taa 357 

Ala Lys Ala Ala Leu Ala * 
115 



<210> 169 










<211> 198 










<212> DNA 










<213> Escherichia coli 










<220> 










<221> CDS 










<222> (1) . . . (198) 










<400> 169 










atg cca aaa att aag acc gta cgc 


ggt get get 


aag cgc 


ttc 


aaa 


Met Pro Lys He Lys Thr Val Arg 


Gly Ala Ala 


Lys Arg 


Phe 


Lys 


1 5 


10 






15 



48 



acc ggt aaa ggt ggt ttt aag cac aag cac get aac ctg cgt cac att 96 
Thr Gly Lys Gly Gly Phe Lys His Lys His Ala Asn Leu Arg His He 
20 25 30 

ctg acc aaa aaa gcg acc aaa cgt aaa cgt cac ctg cgt ccg aaa gcc 14 4 

Leu Thr Lys Lys Ala Thr Lys Arg Lys Arg His Leu Arg Pro Lys Ala 
35 40 45 

atg gtt tec aaa ggc gat ctg ggc ctg gta ate gcg tgc ctg ccg tac 192 
Met Val Ser Lys Gly Asp Leu Gly Leu Val He Ala Cys Leu Pro Tyr 
50 55 60 

gca taa 198 
Ala * 
65 



<210> 170 
<211> 543 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . . (543) 

<400> 170 

atg aaa ggc gga aaa cga gtt caa acg gcg cgc cct aac cgt ate aat 48 

Met Lys Gly Gly Lys Arg Val Gin Thr Ala Arg Pro Asn Arg He Asn 
15 10 15 

ggc gaa att cgc gcc cag gaa gtt cgc tta aca ggt ctg gaa ggc gag 96 
Gly Glu He Arg Ala Gin Glu Val Arg Leu Thr Gly Leu Glu Gly Glu 
20 25 30 
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cag ctt ggt att gtg agt ctg aga gaa get ctg gag aaa gca gaa gaa 14 4 

Gin Leu Gly He Val Ser Leu Arg Glu Ala Leu Glu Lys Ala Glu Glu 
35 40 45 

gec gga gta gac tta gtc gag ate age cct aac gee gag ccg ccg gtt 192 
Ala Gly Val Asp Leu Val Glu He Ser Pro Asn Ala Glu Pro Pro Val 
50 4 55 60 

tgt cgt ata atg gat tac ggc aaa ttc etc tat gaa aag age aag tct 240 
Cys Arg He Met Asp Tyr Gly Lys Phe Leu Tyr Glu Lys Ser Lys Ser 
65 70 75 80 

tct aag gaa cag aag aaa aag caa aaa gtt ate cag gtt aag gaa att 288 
Ser Lys Glu Gin Lys Lys Lys Gin Lys Val He Gin Val Lys Glu He 
85 90 95 

aaa ttc cgt cct ggt aca gat gaa ggc gac tat cag gta aaa etc cgc 336 
Lys Phe Arg Pro Gly Thr Asp Glu Gly Asp Tyr Gin Val Lys Leu Arg 
100 105 110 



age 



ctg att cgc ttt etc gaa gag ggt gat aaa gee aaa ate acg ctg 384 
Ser Leu He Arg Phe Leu Glu Glu Gly Asp Lys Ala Lys He Thr Leu 
115 120 125 



cgt ttc cgc ggt cgt gag atg gcg cac cag caa ate ggt atg gaa gtg 4 32 

Arg Phe Arg Gly Arg Glu Met Ala His Gin Gin lie Gly Met Glu Val 
130 135 140 

ctt aat cgc gtg aaa gac gat ttg caa gaa ctg gca gtg gtc gaa tec 480 
Leu Asn Arg Val Lys Asp Asp Leu Gin Glu Leu Ala Val Val Glu Ser 
145 150 155 160 

ttc cca acg aag ate gaa ggc cgc cag atg ate atg gtg etc get cct 528 
Phe Pro Thr Lys He Glu Gly Arg Gin Met He Met Val Leu Ala Pro 
165 170 175 

aag aag aaa cag taa 54 3 

Lys Lys Lys Gin * 
180 



<210> 171 
<211> 1929 
<212> DNA 

<213> Escherichia coli 



<220> 

<221> CDS 

<222> (1) . . . (1929) 



<400> 171 

atg cct gtt ata act ctt cct gat ggc age caa cgc cat tac gat cac 48 
Met Pro Val He Thr Leu Pro Asp Gly Ser Gin Arg His Tyr Asp His 
1 5 10 , 15 

get gta age ccc atg gat gtt gcg ctg gac att ggt cca ggt ctg gcg 96 
Ala Val Ser Pro Met Asp Val Ala Leu Asp He Gly Pro Gly Leu Ala 
20 25 30 
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aaa gcc tgt ate gca ggg cgc gtt aat ggc gaa ctg gtt gat get tgc 14 4 

Lys Ala Cys lie Ala Gly Arg Val Asn Gly Glu Leu Val Asp Ala Cys 

35 40 45 

gat ctg att gaa aac gac gca caa ctg teg ate att ace gcc aaa gac 192 

Asp Leu He Glu Asn Asp Ala Gin Leu Ser He He Thr Ala Lys Asp 
50 55 60 

gaa gaa ggt ctg gag ate att cgt cac tec tgt gcg cac ctg tta ggg 240 

Glu Glu Gly Leu Glu He He Arg His Ser Cys Ala His Leu Leu Gly 
65 70 75 80 

cac gcg att aaa caa ctt tgg ccg cat acc aaa atg gca ate ggc ccg 288 

His Ala He Lys Gin Leu Trp Pro His Thr Lys Met Ala He Gly Pro 
85 90 95 

gtt att gac aac ggt ttt tat tac gac gtt gat ctt gac cgc acg tta 336 

Val He Asp Asn Gly Phe Tyr Tyr Asp Val Asp Leu Asp Arg Thr Leu 
100 105 110 

acc cag gaa gat gtc gaa gca etc gag aag egg atg cat gag ctt get 384 

Thr Gin Glu Asp Val Glu Ala Leu Glu Lys Arg Met His Glu Leu Ala 

115 " 120 125 

gag aaa aac tac gac gtc att aag aag aaa gtc age tgg cac gaa gcg 432 

Glu Lys Asn Tyr Asp Val He Lys Lys Lys Val Ser Trp His Glu Ala 
130 135 140 

cgt gaa act ttc gcc aac cgt ggg gag age tac aaa gtc tec att ctt 4 80 

Arg Glu Thr Phe Ala Asn Arg Gly Glu Ser Tyr Lys Val Ser He Leu 
145 150 155 160 

gac gaa aac ate gcc cat gat gac aag cca ggt ctg tac ttc cat gaa 528 

Asp Glu Asn He Ala His Asp Asp Lys Pro Gly Leu Tyr Phe His Glu 
165 170 175 

gaa tat gtc gat atg tgc cgc ggt ccg cac gta ccg aac atg cgt ttc 576 

Glu Tyr Val Asp Met Cys Arg Gly Pro His Val Pro Asn Met Arg Phe 
180 185 190 

tgc cat cat ttc aaa eta atg aaa acg gca ggg get tac tgg cgt ggc 624 

Cys His His Phe Lys Leu Met Lys Thr Ala Gly Ala Tyr Trp Arg Gly 

195 200 205 

gac age aac aac aaa atg ttg caa cgt att tac ggt acg gcg tgg gca 672 

Asp Ser Asn Asn Lys Met Leu Gin Arg He Tyr Gly Thr Ala Trp Ala 
210 215 220 

gac aaa aaa gca ctt aac get tac ctg cag cgc ctg gaa gaa gcc gcg 720 

Asp Lys Lys Ala Leu Asn Ala Tyr Leu Gin Arg Leu Glu Glu Ala Ala 
225 230 235 240 

aaa cgc gac cac cgt aaa ate ggt aaa cag etc gac ctg tac cat atg 768 

Lys Arg Asp His Arg Lys He Gly Lys Gin Leu Asp Leu Tyr His Met 
245 250 255 

cag gaa gaa gcg ccg ggt atg gta ttc tgg cac aac gac ggc tgg acc 816 

Gin Glu Glu Ala Pro Gly Met Val Phe Trp His Asn Asp Gly Trp Thr 
260 265 270 
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ate ttc cgt gaa ctg gaa gtg ttt gtt cgt tct aaa ctg aaa gag tac 864 
lie Phe Arg Glu Leu Glu Val Phe Val Arg Ser Lys Leu Lys Glu Tyr 
275 280 285 

cag tat cag gaa gtt aaa ggt ccg ttc atg atg gac cgt gtc ctg tgg 912 
Gin Tyr Gin Glu Val Lys Gly Pro Phe Met Met Asp Arg Val Leu Trp 
290 295 300 

gaa aaa acc ggt cac tgg gac aac tac aaa gat gca atg ttc acc aca 960 
Glu Lys Thr Gly His Trp Asp Asn Tyr Lys Asp Ala Met Phe Thr Thr 
305 ~ 310 315 320 

tct tct gag aac cgt gaa tac tgc att aag ccg atg aac tgc ccg ggt 1008 
Ser Ser Glu Asn Arg Glu Tyr Cys lie Lys Pro Met Asn Cys Pro Gly 
325 330 335 

cac gta caa att ttc aac cag ggg ctg aag tct tat cgc gat ctg ccg 1056 
His Val Gin lie Phe Asn Gin Gly Leu Lys Ser Tyr Arg Asp Leu Pro 
340 345 350 



ctg cgt atg gec gag ttt ggt age tgc cac cgt aac gag ccg tea ggt 
Leu Arg Met Ala Glu Phe Gly Ser Cys His Arg Asn Glu Pro Ser Gly 
355 360 365 



1104 



teg ctg cat ggc ctg atg cgc gtg cgt gga ttt acc cag gat gac gcg 1152 
Ser Leu His Gly Leu Met Arg Val Arg Gly Phe Thr Gin Asp Asp Ala 
370 375 380 

cat ate ttc tgt act gaa gaa caa att cgc gat gaa gtt aac gga tgt 1200 
His He Phe Cys Thr Glu Glu Gin He Arg Asp Glu Val Asn Gly Cys 
385 390 395 400 

ate cgt tta gtc tat gat atg tac age act ttt ggc ttc gag aag ate 1248 
lie Arg Leu Val Tyr Asp Met Tyr Ser Thr Phe Gly Phe Glu Lys He 
405 410 415 

gtc gtc aaa etc tec act cgt cct gaa aaa cgt att ggc age gac gaa 1296 
Val Val Lys Leu Ser Thr Arg Pro Glu Lys Arg He Gly Ser Asp Glu 
420 425 430 

atg tgg gat cgt get gag gcg gac ctg gcg gtt gcg ctg gaa gaa aac 1344 
Met Trp Asp Arg Ala Glu Ala Asp Leu Ala Val Ala Leu Glu Glu Asn 
435 440 445 

aac ate ccg ttt gaa tat caa ctg ggt gaa ggc get ttc tac ggt ccg 1392 
Asn He Pro Phe Glu Tyr Gin Leu Gly Glu Gly Ala Phe Tyr Gly Pro 
450 455 460 

aaa att gaa ttt acc ctg tat gac tgc etc gat cgt gca tgg cag tgc 1440 
Lys He Glu Phe Thr Leu Tyr Asp Cys Leu Asp Arg Ala Trp Gin Cys 
465 470 475 480 

ggt aca gta cag ctg gac ttc tct ttg ccg tct cgt ctg age get tct 1488 
Gly Thr Val Gin Leu Asp Phe Ser Leu Pro Ser Arg Leu Ser Ala Ser 
485 ' 490 495 

tat gta ggc gaa gac aat gaa cgt aaa gta ccg gta atg att cac cgc 1536 
Tyr Val Gly Glu Asp Asn Glu Arg Lys Val Pro Val Met He His Arg 
500 505 510 
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gca att ctg ggg teg atg gaa cgt ttc ate ggt ate ctg acc gaa gag 1584 

Ala He Leu Gly Ser Met Glu Arg Phe He Gly He Leu Thr Glu Glu 

515 520 525 

ttc get ggt ttc ttc ccg acc tgg ctt gcg ccg gtt cag gtt gtt ate 1632 

Phe Ala Gly Phe Phe Pro Thr Trp Leu Ala Pro Val Gin Val Val He 

530 535 540 

atg aat att acc gat tea cag tct gaa tac gtt aac gaa ttg acg caa 1680 

Met Asn He Thr Asp Ser Gin Ser Glu Tyr Val Asn Glu Leu Thr Gin 

545 550 555 560 

aaa eta tea aat gcg ggc att cgt gtt aaa gca gac ttg aga aat gag 1728 

Lys Leu Ser Asn Ala Gly He Arg Val Lys Ala Asp Leu Arg Asn Glu 
565 570 575 

aag att ggc ttt aaa ate cgc gag cac act ttg cgt cgc gtc cca tat 177 6 

Lys He Gly Phe Lys He Arg Glu His Thr Leu Arg Arg Val Pro Tyr 
580 585 590 

atg ctg gtc tgt ggt gat aaa gag gtg gaa tea ggc aaa gtt gee gtt 1824 

Met Leu Val Cys Gly Asp Lys Glu Val Glu Ser Gly Lys Val Ala Val 

595 600 605 

cgc acc cgc cgt ggt aaa gac ctg gga age atg gac gta aat gaa gtg 1872 

Arg Thr Arg Arg Gly Lys Asp Leu Gly Ser Met Asp Val Asn Glu Val 

610 " ~ 615 620 

ate gag aag ctg caa caa gag att cgc age cgc agt ctt aaa caa ttg 1920 

He Glu Lys Leu Gin Gin Glu lie Arg Ser Arg Ser Leu Lys Gin Leu 

625 " 630 635 640 

gag gaa taa 1929 
Glu Glu * 



<210> 172 
<211> 993 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . . (993) 

<400> 172 

atg aaa ata aag aac att eta etc acc ctt tgc acc tea etc ctg ctt 48 

Met Lys He Lys Asn He Leu Leu Thr Leu Cys Thr Ser Leu Leu Leu 
1 5 10 15 

acc aac gtt get gca cac gee aaa gaa gtc aaa ata ggt atg gcg att 96 
Thr Asn Val Ala Ala His Ala Lys Glu Val Lys He Gly Met Ala He 
20 25 30 

gat gat etc cgt ctt gaa cgc tgg caa aaa gat cga gat ate ttt gtg 144 
Asp Asp Leu Arg Leu Glu Arg Trp Gin Lys Asp Arg Asp He Phe Val 
35 40 45 

aaa aag gca gaa tct etc ggc gcg aaa gta ttt gta cag tct gca aat 192 
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Lys Lys Ala Glu Ser Leu Gly Ala Lys Val Phe Val Gin Ser Ala Asn 
50 55 60 

ggc aat gaa gaa aca caa atg teg cag att gaa aac atg ata aac egg 240 
Gly Asn Glu Glu Thr Gin Met Ser Gin lie Glu Asn Met He Asn Arg 
65 70 75 80 

ggt gtc gat gtt ctt gtc att att ccg tat aac ggt cag gta tta agt 288 
Gly Val Asp Val Leu Val lie He Pro Tyr Asn Gly Gin Val Leu Ser 
85 90 95 

aac gtt gta aaa gaa gec aaa caa gaa ggc att aaa gta tta get tac 336 
Asn Val Val Lys Glu Ala Lys Gin Glu Gly He Lys Val Leu Ala Tyr 
100 105 110 

gac cgt atg att aac gat gcg gat ate gat ttt tat att tct ttc gat 384 
Asp Arg Met lie Asn Asp Ala Asp He Asp Phe Tyr He Ser Phe Asp 
115 120 125 

aac gaa aaa gtc ggt gaa ctg cag gca aaa gec ctg gtc gat att gtt 4 32 

Asn Glu Lys Val Gly Glu Leu Gin Ala Lys Ala Leu Val Asp He. Val 
130 135 .140 

ccg caa ggt aat tac ttc ctg atg ggc ggc teg ccg gta gat aac aac 480 
Pro Gin Gly Asn Tyr Phe Leu Met Gly Gly Ser Pro Val Asp Asn Asn 
145 150 155 160 

gec aag ctg ttc cgc gee gga caa atg aaa gtg tta aaa cct tac gtt 528 
Ala Lys Leu Phe Arg Ala Gly Gin Met Lys Val Leu Lys Pro Tyr Val 
165 170 175 

gat tec gga aaa att aaa gtc gtt ggt gac caa tgg gtt gat ggc tgg 57 6 

Asp Ser Gly Lys He Lys Val Val Gly Asp Gin Trp Val Asp Gly Trp 
180 185 190 

tta ccg gaa aac gca ttg aaa att atg gaa aac gcg eta acc gec aat 624 
Leu Pro Glu Asn Ala Leu Lys He Met Glu Asn Ala Leu Thr Ala Asn 
195 200 205 

aat aac aaa att gat get gta gtt gec tea aac gat gee acc gca ggt 672 
Asn Asn Lys He Asp Ala Val Val Ala Ser Asn Asp Ala Thr Ala Gly 
210 215 220 

ggg gca att cag gca tta age gcg caa ggt tta tea ggg aaa gta gca 720 
Gly Ala He Gin Ala Leu Ser Ala Gin Gly Leu Ser Gly Lys Val Ala 
225 230 235 240 

ate tec ggc cag gat gcg gat etc gca ggt aLt aaa cgL att get gee 768 
He Ser Gly Gin Asp Ala Asp Leu Ala Gly He Lys Arg He Ala Ala 
245 250 255 

ggt acg caa act atg acg gtg tat aaa cct att acg ttg ttg gca aat 816 
Gly Thr Gin Thr Met Thr Val Tyr Lys Pro He Thr Leu Leu Ala Asn 
260 ^ 265 270 

act gee gca gaa att gee gtt gag ttg ggc aat ggt cag gaa cca aaa 864 
Thr Ala Ala Glu He Ala Val Glu Leu Gly Asn Gly Gin Glu Pro Lys 
275 280 285 

gca gat acc aca ctg aat aat ggc ctg aaa gat gtc ccc tec cgc etc 912 
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Ala Asp Thr Thr Leu Asn Asn Gly Leu Lys Asp Val Pro Ser Arg Leu 

290 295 300 

ctg aca ccg ate gat gtg aat aaa aac aac ate aaa gat acg gta att 960 

Leu Thr Pro He Asp Val Asn Lys Asn Asn He Lys Asp Thr Val He 

305 310 315 320 

aaa gac gga ttc cac aaa gag age gag ctg taa 993 

Lys Asp Gly Phe His Lys Glu Ser Glu Leu * 
325 330 



<210> 173 
<211> 168 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . . (168) 

<400> 173 

atg aac aaa ttt att aaa gtt gca ctg gta ggt gca gta ctg get acg 48 

Met Asn Lys Phe He Lys Val Ala Leu Val Gly Ala Val Leu Ala Thr 

1 5 10 15 

tta act gca tgt act ggt cat att gaa aac cgt gat aag aac tgc tct 96 
Leu Thr Ala Cys Thr Gly His He Glu Asn Arg Asp Lys Asn Cys Ser 
20 " 25 - 

tac gac tac ctg ctg cac ccg gca att tct att tct aaa ate att ggc 144 
Tyr Asp Tyr Leu Leu His Pro Ala He Ser He Ser Lys He He Gly 
35 40 45 

ggt tgc ggt cct act gca cag taa 168 
Gly Cys Gly Pro Thr Ala Gin * 
50 55 



<210> 174 
<211> 1389 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . . (1389) 

<400> 174 

atg act gca aac tct ccc eta caa cgt att gga caa gaa aaa ggt ate 48 

Met Thr Ala Asn Ser Pro Leu Gin Arg He Gly Gin Glu Lys Gly He 
15 10 15 

get atg gga age cag gaa etc caa cgc aag etc gga ttt tgg gee gtt 96 
Ala Met Gly Ser Gin Glu Leu Gin Arg Lys Leu Gly Phe Trp Ala Val 
20 25 30 

ctt gca ate gee gtc ggg aca ace gtc ggc tec ggt att ttt gta tct 144 
Leu Ala He Ala Val Gly Thr Thr Val Gly Ser Gly He Phe Val Ser 
35 40 45 
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gtg ggt gaa gtg gca aaa gca gcg ggc acg ccg tgg ctt acg gtg etc 192 
Val Gly Glu Val Ala Lys Ala Ala Gly Thr Pro Trp Leu Thr Val Leu 
50 55 60 

gcg ttt gtc att ggc ggg tta att gtg ate ccg caa atg tgc gtc tat 240 
Ala Phe Val He Gly Gly Leu He Val He Pro Gin Met Cys Val Tyr 
65 70 75 80 

gcg gaa eta tec ace get tat ccg gaa aat ggc gca gat tat gtt tat 2B8 
Ala Glu Leu Ser Thr Ala Tyr Pro Glu Asn Gly Ala Asp Tyr Val Tyr 
85 90 95 

ctg aaa aat gee gga age cga ccg ctg get ttc etc tec ggc tgg gee 336 
Leu Lys Asn Ala Gly Ser Arg Pro Leu Ala Phe Leu Ser Gly Trp Ala 
100 105 110 

age ttc tgg gec aac gat gcg ccg tea ttg teg att atg gcg ctg gcg 384 
Ser Phe Trp Ala Asn Asp Ala Pro Ser Leu Ser He Met Ala Leu Ala 
115 120 125 

att gtc age aat ctt ggc ttt tta acg cct ate gat ccg ttg etc ggt 432 
He Val Ser Asn Leu Gly Phe Leu Thr Pro He Asp Pro Leu Leu Gly 
130 135 140 

aaa ttt ate gee gee gga tta att ate. gee ttt atg ttg eta cac ctg 480 
Lys Phe He Ala Ala Gly Leu He He Ala Phe Met Leu Leu His Leu 
145 150 155 160 

cgc tec gtt gaa ggc ggc gca gcg ttt cag acg eta att acc ate gee 528 
Arg Ser Val Glu Gly Gly Ala Ala Phe Gin Thr Leu He Thr He Ala 
165 170 175 

aaa att ate ccg ttc act ate gtc att ggc ctt ggg ate ttc tgg ttt 576 
Lys He He Pro Phe Thr He Val He Gly Leu Gly He Phe Trp Phe 
180 185 190 

aaa gcg gag aat ttt gee gec cct acc acc act gcg att ggc gca acg 624 
Lys Ala Glu Asn Phe Ala Ala Pro Thr Thr Thr Ala He Gly Ala Thr 
195 200 205 

ggc age ttt atg gcg ctg ctg gcg ggg ate tct gee acc agt tgg teg 672 
Gly Ser Phe Met Ala Leu Leu Ala Gly lie Ser Ala Thr Ser Trp Ser 
210 215 220 

tat acc ggc atg gee tct ate tgt tat atg acc ggc gaa att aaa aac 720 
Tyr Thr Gly Met Ala Ser He Cys Tyr Met Thr Gly Glu He Lys Asn 
225 230 235 240 

ccc gga aaa acc atg cca cga gcg ctg att ggt tec tgt ctg ctg gtt 768 
Pro Gly Lys Thr Met Pro Arg Ala Leu He Gly Ser Cys Leu Leu Val 
245 250 255 

ctg gtg etc tac acc ctg ctg gcg ctg gtg att tec ggc ctg atg ccc 816 
Leu Val Leu Tyr Thr Leu Leu Ala Leu Val lie Ser Gly Leu Met Pro 
260 265 270 

ttc gac aaa etc gec aat tct gaa acg ccg att tec gac gec ctg acc 864 
Phe Asp Lys Leu Ala Asn Ser Glu Thr Pro He Ser Asp Ala Leu Thr 
275 280 285 
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tgg ate ccc gca etc ggc age acc get ggg ate ttt gtt gcc ate acg 912 
Trp He Pro Ala Leu Gly Ser Thr Ala Gly He Phe Val Ala He Thr 
290 295 300 

gcg atg ate gtc att ctt ggt teg ctt tec age tge gtg atg tac cag 960 
Ala Met He Val He Leu Gly Ser Leu Ser Ser Cys Val Met Tyr Gin 
305 310 315 320 

ccg egg ctg gaa tac gcg atg gcg aaa gac aac ctg ttc ttt aaa tge 1008 
Pro Arg Leu Glu Tyr Ala Met Ala Lys Asp Asn Leu Phe Phe Lys Cys 
325 330 335 

ttc ggc cat gtg cat ccg aaa tac aac acg ccg gat gtc tec ate ate 1056 
Phe Gly His Val His Pro Lys Tyr Asn Thr Pro Asp Val Ser He He 
340 345 350 

ctg caa ggg gcg ctg ggg ate ttc ttc ate ttc gtt tec gat etc acc 1104 
Leu Gin Gly Ala Leu Gly He Phe Phe He Phe Val Ser Asp Leu Thr 
355 360 365 

age ctg ctg ggt tat ttc acc ctg gtg atg tgt ttc aaa aat acc etc 1152 
Ser Leu Leu Gly Tyr Phe Thr Leu Val Met Cys Phe Lys Asn Thr Leu 
370 375 380 

acc ttc ggc tec ate ate tgg tgt cgt aaa cgc gac gat tac aaa ccg 1200 
Thr Phe Gly Ser He He Trp Cys Arg Lys Arg Asp Asp Tyr Lys Pro 
385 390 395 400 

ctg tgg cgt act ccg get ttc ggg ctg atg acc acc etc gcc att gcg 1248 
Leu Trp Arg Thr Pro Ala Phe Gly Leu Met Thr Thr Leu Ala He Ala 
405 410 415 

tea age etc att ctg gtc gcc tea acc ttt gtc tgg gca ccg att ccc 1296 
Ser Ser Leu He Leu Val Ala Ser Thr Phe Val Trp Ala Pro He Pro 
420 425 430 

ggc ctt ate tge gcc gtc ate gtt att get act ggt ctg cct get tac 1344 
Gly Leu He Cys Ala Val He Val He Ala Thr Gly Leu Pro Ala Tyr 
435 440 445 

gcc ttc tgg gcg aag cgt age cgc cag etc aac get ttg teg taa 1389 
Ala Phe Trp Ala Lys Arg Ser Arg Gin Leu Asn Ala Leu Ser * 
450 455 460 



<210> 175 
<211> 1044 
<212> DNA 

<213> Escherichia coli 

<220> 
<221> CDS 

<222> (1) . . . (1044) 
<400> 175 

atg tta tct gga gaa aat aaa atg ttg gat att gat aaa age acc gtg 48 
Met Leu Ser Gly Glu Asn Lys Met Leu Asp He Asp Lys Ser Thr Val 
15 10 15 
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gac ttt ctg gtc acc gaa aat atg gtt cag gaa gtg gaa aaa gtt etc 96 

Asp Phe Leu Val Thr Glu Asn Met Val Gin Glu Val Glu Lys Val Leu 

20 25 30 

age cat gac gtt ccg ctg gtg cac gec ate gtg gaa gag atg gtg aag 144 

Ser His Asp Val Pro Leu Val His Ala He Val Glu Glu Met Val Lys 

35 40 45 

cgc gac att gat cgt att tat ttc gtt gee tgc gga teg cca etc aac 192 

Arg Asp He Asp Arg He Tyr Phe Val Ala Cys Gly Ser Pro Leu Asn 

50 55 60 

gcg gcg caa acg gcg aaa cat ctg gcg gat cgc ttt tec gat ctt cag 240 

Ala Ala Gin Thr Ala Lys His Leu Ala Asp Arg Phe Ser Asp Leu Gin 

65 70 75 80 

gtc tac gec att tec ggc tgg gag ttc tgc gat aac acc ccg tat cgc 288 

Val Tyr Ala He Ser Gly Trp Glu Phe Cys Asp Asn Thr Pro Tyr Arg 

85 90 95 

etc gac gat cgt tgc gca gta att ggc gtt tct gac tac ggt aaa acc 336 

Leu Asp Asp Arg Cys Ala Val He Gly Val Ser Asp Tyr Gly Lys Thr 

100 * 105 110 

gaa gag gta ate aaa gcg ctg gag ctg ggc egg gec tgc ggc gca etc 384 

Glu Glu Val He Lys Ala Leu Glu Leu Gly Arg Ala Cys Gly Ala Leu 

115 120 125 

act gcg gcg ttc acc aaa cgc gcg gat age ccg att acc teg gcg gcg 432 

Thr Ala Ala Phe Thr Lys Arg Ala Asp Ser Pro He Thr Ser Ala Ala 

130 ~ 135 140 

gaa ttt age att gat tat cag gee gac tgt ate tgg gaa att cac ctg 480 

Glu Phe Ser He Asp Tyr Gin Ala Asp Cys He Trp Glu He His Leu 

145 150 155 160 

ctg etc tgc tac age gtg gtg ctg gag atg ate acc cgc etc gcg ccg 528 

Leu Leu Cys Tyr Ser Val Val Leu Glu Met He Thr Arg Leu Ala Pro 

165 170 175 

aac gcg gaa ate ggc aag ate aaa aac gat etc aag cag ttg ccg aat 57 6 
Asn Ala Glu He Gly Lys He Lys Asn Asp Leu Lys Gin Leu Pro Asn 

180 185 190 

gcg etc ggt cat ctg gta cgc acc tgg gaa gaa aaa ggc cgc cag ctt 624 

Ala Leu Gly His Leu Val Arg Thr Trp Glu Glu Lys Gly Arg Gin Leu 

195 200 205 

ggt gaa ctg gec age cag tgg ccg atg att tat acc gtt get gcg ggt 672 
Gly Glu Leu Ala Ser Gin Trp Pro Met He Tyr Thr Val Ala Ala Gly 

210 215 220 

ccg ctg cgt ccg ctg ggt tac aaa gaa ggc att gta acg ctg atg gaa 720 
Pro Leu Arg Pro Leu Gly Tyr Lys Glu Gly He Val Thr Leu Met Glu 

225 230 235 240 

ttt acc tgg acg cac ggc tgc gtg att gag age gga gag ttc cgc cat 768 

Phe Thr Trp Thr His Gly Cys Val He Glu Ser Gly Glu Phe Arg His 

245 250 255 
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ggc ccg ctg gag att gtc gaa ccg ggc gtt ccg ttc ctg ttc ctg etc 816 
Gly Pro Leu Glu lie Val Glu Pro Gly Val Pro Phe Leu Phe Leu Leu 
260 265 270 

ggc aat gat gaa agt cgc cac acc acc gaa cgc gec att aac ttt gtt 8 64 

Gly Asn Asp Glu Ser Arg His Thr Thr Glu Arg Ala lie Asn Phe Val 
275 280 285 

aaa cag cgt act gac aac gtg ate gtc ate gat tac gee gaa att teg 912 
Lys Gin Arg Thr Asp Asn Val lie Val lie Asp Tyr Ala Glu lie Ser 
290 295 300 

caa ggg ctg cac ccg tgg ctg gca ccg ttc ctg atg ttc gtg cca atg 960 
Gin Gly Leu His Pro Trp Leu Ala Pro Phe Leu Met Phe Val Pro Met 
305 310 315 320 

gag tgg etc tgc tac tac ctg tct att tac aaa gat cac aac ccg gat 1008 
Glu Trp Leu Cys Tyr Tyr Leu Ser lie Tyr Lys Asp His Asn Pro Asp 
325 330 335 

gaa cgc cgc tat tac ggt ggt ctg gtg gaa tat taa 104 4 

Glu Arg Arg Tyr Tyr Gly Gly Leu Val Glu Tyr * 
340 ^ 345 



<210> 176 
<211> 450 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . . (450) 

<400> 176 

atg aac gee get att acg gtg gtc tgg tgg aat att aat ccc tct ccc 48 

Met Asn Ala Ala He Thr Val Val Trp Trp Asn He Asn Pro Ser Pro 
15 10 15 

ggc ccg gca acg ggc egg att tac gca agg agt tac ccg atg aaa aca 96 
Gly Pro Ala Thr Gly Arg He Tyr Ala Arg Ser Tyr Pro Met Lys Thr 
20 25 30 

ggt atg ttt acc tgc ggc cac cag egg ctg cct att gaa cac gca ttt 144 
Gly Met Phe Thr Cys Gly His Gin Arg Leu Pro He Glu His Ala Phe 
35 40 45 

cgt gat gca age gag ctg ggt tac gac ggc ate gaa att tgg ggc ggt 192 
Arg Asp Ala Ser Glu Leu Gly Tyr Asp Gly He Glu He Trp Gly Gly 
50 55 60 

cgc ccg cac gcg ttc gcg ccg gac tta aaa gcg ggc ggc ate aaa caa 240 
Arg Pro His Ala Phe Ala Pro Asp Leu Lys Ala Gly Gly He Lys Gin 
65 70 75 80 

ate aag gcg ctg gcg cag acg tat cag atg ccg att ate ggc tat acg 288 
He Lys Ala Leu Ala Gin Thr Tyr Gin Met Pro He lie Gly Tyr Thr 
85 90 95 

cca gaa acc aac ggc tat ccg tat aac atg atg ctg ggc gat gaa cat 336 
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Pro Glu Thr Asn Gly Tyr 
100 

atg cgt cgc gaa age etc 
Met Arg Arg Glu Ser Leu 
115 

aaa gag atg aac gcg ggt 
Lys Glu Met Asn Ala Gly 
130 

eta tct cac gee ace taa 
Leu Ser His Ala Thr * 
145 



Pro Tyr Asn Met Met Leu 
105 

gac atg ate aag ctg gcg 
Asp Met lie Lys Leu Ala 
120 

tat acg ctg att tec gee 
Tyr Thr Leu lie Ser Ala 
135 140 



Gly Asp Glu His 
110 

atg gat atg gca 384 

Met Asp Met Ala 

125 

ggc cca cgc ggg 4 32 

Gly Pro Arg Gly 



450 



<210> 177 

<211> 384 ... 
<212> DNA 

<213> Escherichia coli 

<220> - . 

<221> CDS 

<222> (1) . . . (384) 

<400> 177 

atg cgt gat att cag atg gtt ctt gag cgt tgg gga gcg tgg gcg get 48 
Met Arg Asp He Gin Met Val Leu Glu Arg Trp Gly Ala Trp Ala Ala 
15 .10 15 

aat aat cat gaa gat gtg acc tgg teg tec att gec gee ggt ttt aag 96 
Asn Asn His Glu Asp Val Thr Trp Ser Ser He Ala Ala Gly Phe Lys 
20 25 30 

gga tta att act tea aaa gta aaa tct cgc ccg caa tgt tgt gac gat 144 
Gly Leu He Thr Ser Lys Val Lys Ser Arg Pro Gin Cys Cys Asp Asp 
35 40 45 

gac gcg atg ate att tgc ggg tgc atg gee cgt ctg aaa aag aac aac 192 
Asp Ala Met He lie Cys Gly Cys Met Ala Arg Leu Lys Lys Asn Asn 
50 55 60 

age gat ttg cac gat tta tta gta gat tat tat gta gtc ggt atg aca 240 
Ser Asp Leu His Asp Leu Leu Val Asp Tyr Tyr Val Val Gly Met Thr 
65 70 75 80 

ttc atg tea ctg gca ggt aag cat tgc tgc tct gat ggt tat ate ggg 288 
Phe Met Ser Leu Ala Gly Lys His Cys Cys Ser Asp Gly Tyr He Gly 
85 90 95 

aaa agg tta cag aag get gag ggc ata att gaa ggg atg tta atg gca 336 
Lys Arg Leu Gin Lys Ala Glu Gly He lie Glu Gly Met Leu Met Ala 
100 ~ 105 110 

tta gat ate egg tta gag atg gat ate gtt gtt aat aac tct aat taa 384 
Leu Asp He Arg Leu Glu Met Asp He Val Val Asn Asn Ser Asn * 
115 120 125 



<210> 178 
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<211> 1362 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1)...(1362) 

<400> 178 

atg ttt gat aat tta acc gat cgt ttg teg cgc acg ctg cgc aat ate 48 

Met Phe Asp Asn Leu Thr Asp Arg Leu Ser Arg Thr Leu Arg Asn lie 
15 10 15 

agt ggc cgt gga cgc etc act gaa gac aac gta aaa gat acg ctg cgc 96 
Ser Gly Arg Gly Arg Leu Thr Glu Asp Asn Val Lys Asp Thr Leu Arg 
20 25 30 

gaa gtg cgc atg gcg ctg ctg gag gcg gac gta get ctg ccg gta gtg 144 
Glu Val Arg Met Ala Leu Leu Glu Ala Asp Val Ala Leu Pro Val Val 
35 40 45 

cgt gag ttt ate aat cgc gta aaa gag aaa gcg gtt ggt cat gaa gtt 192 
Arg Glu Phe He Asn Arg Val Lys Glu Lys Ala Val Gly His Glu Val 
50 55 60 . 

aat aag age ctg acg ccg ggg cag gag ttc gtc aaa ata gtc cgt aac 240 
Asn Lys Ser Leu Thr Pro Gly Gin Glu Phe Val Lys He Val Arg Asn 
65 70 75 80 

gaa ctg gtt gcg gcg atg ggc gaa gag aac cag acc ctg aac ctg get 288 
Glu Leu Val Ala Ala Met Gly Glu Glu Asn Gin Thr Leu Asn Leu Ala 
85 90 95 

gcg caa ccg cct gcg gtc gta ctg atg gcg ggc ctg caa ggt gee ggt 336 
Ala Gin Pro Pro Ala Val Val Leu Met Ala Gly Leu Gin Gly Ala Gly 
100 105 110 

aaa aca acc age gtt ggt aaa etc ggt aag ttc ctg cgc gag aag cac 384 
Lys Thr Thr Ser Val Gly Lys Leu Gly Lys Phe Leu Arg Glu Lys His 
115 120 125 

aag aag aaa gtg ctg gtg gtt tct gee gac gtt tat cgc ccg gcg gca 432 
Lys Lys Lys Val Leu Val Val Ser Ala Asp Val Tyr Arg Pro Ala Ala 
130 * 135 140 

ate aaa cag ctt gag acg ctg gca gag cag gtg ggc gtt gat ttc ttc 480 
He Lys Gin Leu Glu Thr Leu Ala Glu Gin Val Gly Val Asp Phe Phe 
145 150 155 160 

cct tct gat gtt ggt cag aag ccg gta gat ate gtt aac gcg gcg ctg 528 
Pro Ser Asp Val Gly Gin Lys Pro Val Asp lie Val Asn Ala Ala Leu 
165 170 175 

aaa gaa gee aaa ctg aaa ttc tac gac gtg ctg ctg gtg gat acc get 57 6 

Lys Glu Ala Lys Leu Lys Phe Tyr Asp Val Leu Leu Val Asp Thr Ala 
180 185 190 

ggt cgt ctg cac gtt gac gaa gcg atg atg gac gag ate aaa caa gtc 624 
Gly Arg Leu His Val Asp Glu Ala Met Met Asp Glu He Lys Gin Val 
195 200 205 
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cat gcg teg att aac ccg gtt gaa acc ctg ttt gtg gtt gac gec atg 672 
His Ala Ser lie Asn Pro Val Glu Thr Leu Phe Val Val Asp Ala Met 
210 215 220 



acc ggt cag gat gcg gec aat acg gca aaa gca ttc aat gaa gcg tta 
Thr Gly Gin Asp Ala Ala Asn Thr Ala Lys Ala Phe Asn Glu Ala Leu 
225 * 230 235 240 



aag ctg ccg ggc atg ggg cag ate ccg gat aac gtc aag tea cag atg 
Lys Leu Pro Gly Met Gly Gin lie Pro Asp Asn Val Lys Ser Gin Met 
355 360 365 



720 



ccg ctt acc ggc gta gtg ttg acc aaa gtg gac ggc gat gec cgc ggc 768 
Pro Leu Thr Gly Val Val Leu Thr Lys Val Asp Gly Asp Ala Arg Gly 
245 250 255 

ggt gcg gcg etc tct att cgt cac ate act ggc aaa ccg ate aag ttc 816 
Gly Ala Ala Leu Ser lie Arg His lie Thr Gly Lys Pro He Lys Phe 
260 " 265 270 

etc ggt gtt ggc gag aaa act gag gcg ctg gag ccg ttc cat ccg gac 864 
Leu Gly Val Gly Glu Lys Thr Glu Ala Leu Glu Pro Phe His Pro Asp 
275 280 285 

cgc ate gcg teg cgt att etc ggc atg ggc gac gta ctg teg ctg ate 912 
Arg He Ala Ser Arg He Leu Gly Met Gly Asp Val Leu Ser Leu He 
290 295 300 

gaa gat ate gaa age aaa gtt gac cgc gcg cag gca gag aaa tta gee 960 
Glu Asp He Glu Ser Lys Val Asp Arg Ala Gin Ala Glu Lys Leu Ala 
305 310 315 320 

age aag ctg aaa aaa ggt gac ggc ttc gat etc aac gac ttt ctt gag 1008 
Ser Lys Leu Lys Lys Gly Asp Gly Phe Asp Leu Asn Asp Phe Leu Glu 
325 * 330 335 

cag ctg cgc cag atg aaa aat atg ggc ggc atg get agt ctg atg ggc 1056 
Gin Leu Arg Gin Met Lys Asn Met Gly Gly Met Ala Scr Leu Met Gly 
340 345 350 



1104 



gac gat aaa gtg ctg gtg cgt atg gaa gee ate ate aac teg atg acg 1152 

Asp Asp Lys Val Leu Val Arg Met Glu Ala He He Asn Ser Met Thr 

370 375 380 

atg aaa gag cgc get aag cca gaa ate ate aaa ggt teg cgt aaa cgc 1200 

Met Lys Glu Arg Ala Lys Pro Glu He He Lys Gly Ser Arg Lys Arg 

385 390 395 400 

cgt att get gec ggt tgc ggt atg cag gtg cag gac gtt aac cgt ctt 1248 

Arg He Ala Ala Gly Cys Gly Met Gin Val Gin Asp Val Asn Arg Leu 
405 410 415 

ctg aaa cag ttc gac gac atg cag cgc atg atg aag aaa atg aag aag 1296 

Leu Lys Gin Phe Asp Asp Met Gin Arg Met Met Lys Lys Met Lys Lys 
420 425 430 

ggc gga atg gcg aag atg atg aga age atg aag ggt atg atg ccc cca 1344 

Gly Gly Met Ala Lys Met Met Arg Ser Met Lys Gly Met Met Pro Pro 

435 440 445 
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ggc ttc cct ggt cgc taa 1362 
Gly Phe Pro Gly Arg * 
450 



<210> 179 
<211> 1734 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . . (1734) 

<400> 179 

gtg aaa caa cag ata caa ctt cgt cgc cgt gaa gtc gat gaa acg gca 

Met Lys Gin Gin lie Gin Leu Arg Arg Arg Glu Val Asp Glu Thr Ala 

15 10 15 



ctg ccc tgg cag caa ctg age ggc gtc gaa aag gec gtt gag ate ctt 
Leu Pro Trp Gin Gin Leu Ser Gly Val Glu Lys Ala Val Glu lie Leu 
50 55 60 



48 



gac ttg ccc get gaa ttg cct ccc ttg ctg cgc cgt tta tac gec age 96 

Asp Leu Pro Ala Glu Leu Pro Pro Leu Leu Arg Arg Leu Tyr Ala Ser 

20 25 30 

egg gga gta cgc agt gcg caa gaa ctg gaa cgc agt gtt aaa ggt atg 14 4 

Arg Gly Val Arg Ser Ala Gin Glu Leu Glu Arg Ser Val Lys Gly Met 

35 40 45 



192 



tac aac get ttt cgc gaa gga acg egg att att gtg gtc ggt gat ttc 240 

Tyr Asn Ala Phe Arg Glu Gly Thr Arg lie He Val Val Gly Asp Phe 

65 70 75 80 

gac gee gac ggc gcg acc age acg get eta age gtg ctg gcg atg cgc 288 

Asp Ala Asp Gly Ala Thr Ser Thr Ala Leu Ser Val Leu Ala Met Arg 

85 90 95 

teg ctt ggt tgc age aat ate gac tac ctg gta cca aac cgt ttc gaa 336 

Ser Leu Gly Cys Ser Asn He Asp Tyr Leu Val Pro Asn Arg Phe Glu 

100 105 110 

gac ggt tac ggc tta age ccg gaa gtg gtc gat cag gec cat gee cgt 384 

Asp Gly Tyr Gly Leu Ser Pro Glu Val Val Asp Gin Ala His Ala Arg 
115 120 125 

ggc gcg cag tta att gtc acg gtg gat aac ggt att tec tec cat gcg 432 

Gly Ala Gin Leu He Val Thr Val Asp Asn Gly He Ser Ser His Ala 

130 135 140 

ggg gtt gag cac get cgc teg ttg ggc ate ccg gtt att gtt acc gat 480 

Gly Val Glu His Ala Arg Ser Leu Gly He Pro Val lie Val Thr Asp 

145 150 155 160 

cac cat ttg cca ggc gac aca tta ccc gca gcg gaa gcg ate att aac 528 

His His Leu Pro Gly Asp Thr Leu Pro Ala Ala Glu Ala He He Asn 

165 * 170 175 
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cct aac ttg cgc gac tgt aat ttc ccg teg aaa tea ctg gca ggc gtg 576 
Pro Asn Leu Arg Asp Cys Asn Phe Pro Ser Lys Ser Leu Ala Gly Val 
180 * 185 190 

ggt gtg gcg ttt tat ctg atg ctg gcg ctg cgc ace ttt ttg cgc gat 624 
Gly Val Ala Phe Tyr Leu Met Leu Ala Leu Arg Thr Phe Leu Arg Asp 
195 200 205 

cag ggc tgg ttt gat gag cgt aac ate gca att cct aac ctg gca gaa 672 
Gin Gly Trp Phe Asp Glu Arg Asn lie Ala lie Pro Asn Leu Ala Glu 
210 ~ 215 220 

ctg ctg gat ctg gtc gcg ctg ggg aca gtg gcg gac gtc gtg ccg ctg 720 
Leu Leu Asp Leu Val Ala Leu Gly Thr Val Ala Asp Val Val Pro Leu 
225 230 235 240 

gac get aat aat cgc att ctg acc tgg cag ggg atg agt cgc ate cga 768 
Asp Ala Asn Asn Arg He Leu Thr Trp Gin Gly Met Ser Arg He Arg 
245 250 255 

gee gga aag tgc cgt ccg ggg att aaa gcg ctg ctt gaa gtg gca aac 816 
Ala Gly Lys Cys Arg Pro Gly He Lys Ala Leu Leu Glu Val Ala Asn 
260 " ' 265 270 

cgt gat gca caa aaa etc gee gee age gat tta ggt ttt gcg ctg ggg 864 
Arg Asp Ala Gin Lys Leu Ala Ala Ser Asp Leu Gly Phe Ala Leu Gly 
275 ^ 280 285 

cca cgt etc aat get gee gga cga ctg gac gat atg tec gtc ggt gtg 912 
Pro Arg Leu Asn Ala Ala Gly Arg Leu Asp Asp Met Ser Val Gly Val 
290 295 300 

gcg ctg ttg ttg tgc gac aac ate ggc gaa gcg cgc gtg ctg gca aat 960 
Ala Leu Leu Leu Cys Asp Asn He Gly Glu Ala Arg Val Leu Ala Asn 
305 310 315 320 

gaa etc gat gcg eta aac cag acg cga aaa gag ate gaa caa gga atg 1008 
Glu Leu Asp Ala Leu Asn Gin Thr Arg Lys Glu He Glu Gin Gly Met 
325 330 335 

caa att gaa gee ctg acc ctg tgc gag aaa ctg gag cgc age cgt gac 1056 
Gin He Glu Ala Leu Thr Leu Cys Glu Lys Leu Glu Arg Ser Arg Asp 
340 345 350 

acg eta ccc ggc ggg ctg gca atg tat cac ccc gaa tgg cat cag ggc 1104 
Thr Leu Pro Gly Gly Leu Ala Met Tyr His Pro Glu Trp His Gin Gly 
355 360 365 

gtt gtc ggt att ctg get teg cgc ate aaa gag cgt ttt cac cgt ccg 1152 
Val Val Gly He Leu Ala Ser Arg lie Lys Glu Arg Phe His Arg Pro 
370 375 380 

gtt ate gcg ttt gcg cca gca ggt gac ggt acg ctg aaa ggt tec ggt 1200 
Val lie Ala Phe Ala Pro Ala Gly Asp Gly Thr Leu Lys Gly Ser Gly 
385 390 395 400 

cgc tec att cag ggg ctg cat atg cgt gat gcg ctg gag cga tta gac 1248 
Arg Ser He Gin Gly Leu His Met Arg Asp Ala Leu Glu Arg Leu Asp 
405 410 415 
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aca etc tac cct ggc atg atg ctg aag ttt ggc ggt cat gcg atg gcg 1296 
Thr Leu Tyr Pro Gly Met Met Leu Lys Phe Gly Gly His Ala Met Ala 
420 425 430 

gcg ggt ttg teg ctg gaa gag gat aaa ttc aaa etc ttt caa caa egg 134 4 

Ala Gly Leu Ser Leu Glu Glu Asp Lys Phe Lys Leu Phe Gin Gin Arg 
435 440 445 

ttt ggc gaa ctg gtt act gag tgg ctg gac cct teg eta ttg caa ggc 1392 
Phe Gly Glu Leu Val Thr Glu Trp Leu Asp Pro Ser Leu Leu Gin Gly 
450 455 460 

gaa gtg gta tea gac ggt ccg tta age ccg gec gaa atg acc atg gaa 1440 
Glu Val Val Ser Asp Gly Pro Leu Ser Pro Ala Glu Met Thr Met Glu 
465 470 475 480 

gtg gcg cag ctg ctg cgc gat get ggc ccg tgg ggg cag atg ttc ccg 1488 
Val Ala Gin Leu Leu Arg Asp Ala Gly Pro Trp Gly Gin Met Phe Pro 
485 490 495 

gag ccg ctg ttt gac ggt cat ttc cgt ctg ctg caa cag egg ctg gtg 1536 
Glu Pro Leu Phe Asp Gly His Phe Arg Leu Leu Gin Gin Arg Leu Val 
500 505 510 

ggc gaa cgt cat ttg aag gtg atg gtc gaa ccg gtc ggc ggc ggt cca 1584 
Gly Glu Arg His Leu Lys Val Met Val Glu Pro Val Gly Gly Gly Pro 
515 " 520 525 

ctg ctg gat ggt att get ttt aat gtc gat acc gec etc tgg ccg gat 1632 
Leu Leu Asp Gly lie Ala Phe Asn Val Asp Thr Ala Leu Trp Pro Asp 
530 " ^ 535 540 

aac ggc gtg cgc gaa gtg caa ctg get tat aag etc gat ate aac gag 1680 
Asn Gly Val Arg Glu Val Gin Leu Ala Tyr Lys Leu Asp lie Asn Glu 
545 550 555 560 

ttt cgc ggc aac cgc age ctg caa att ate ate gac aat ate tgg cca 1728 
Phe Arg Gly Asn Arg Ser Leu Gin He He He Asp Asn He Trp Pro 
565 570 575 

att tag 1734 
He * 



<210> 180 
<211> 711 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . . (711) 

<400> 180 

atg aag aaa ggt ttt atg ttg ttt act ttg tta gcg gcg ttt tea ggc 48 

Met Lys Lys Gly Phe Met Leu Phe Thr Leu Leu Ala Ala Phe Ser Gly 

1 5 10 15 

ttt get cag get gat gac gcg gca att caa caa acg tta gec aaa atg 96 
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Phe Ala Gin Ala Asp Asp Ala Ala lie Gin Gin Thr Leu Ala Lys Met 
20 25 30 

ggc ate aaa age age gat att cag ccc gcg cct gta get ggc atg aag 144 
Gly lie Lys Ser Ser Asp lie Gin Pro Ala Pro Val Ala Gly Met Lys 
35 40 45 

aca gtt ctg act aac age ggc gtg ttg tac ate ace gat gat ggt aaa 192 
Thr Val Leu Thr Asn Ser Gly Val Leu Tyr He Thr Asp Asp Gly Lys 
50 55 60 

cat ate att cag ggg cca atg tat gac gtt agt ggc acg get ccg gtc 240 
His He He Gin Gly Pro Met Tyr Asp Val Ser Gly Thr Ala Pro Val 
65 70 75 80 

aat gtc ace aat aag atg ctg tta aag cag ttg aat gcg ctt gaa aaa 
Asn Val Thr Asn Lys Met Leu Leu Lys Gin Leu Asn Ala Leu Glu Lys 
85 90 95 

gag atg ate gtt tat aaa gcg ccg cag gaa aaa cac gtc ate acc gtg 
Glu Met He Val Tyr Lys Ala Pro Gin Glu Lys His Val He Thr Val 
100 105 110 

ttt act gat att acc tgt ggt tac tgc cac aaa ctg cat gag caa atg 
Phe Thr Asp He Thr Cys Gly Tyr Cys His Lys Leu His Glu Gin Met 
115 120 125 

gca gac tac aac gcg ctg ggg ate acc gtg cgt tat ctt get ttc ccg 
Ala Asp Tyr Asn Ala Leu Gly He Thr Val Arg Tyr Leu Ala Phe Pro 
130 135 140 

cgc cag ggg ctg gac age gat gca gag aaa gaa atg aaa get ate tgg 
Arg Gin Gly Leu Asp Ser Asp Ala Glu Lys Glu Met Lys Ala He Trp 
145 ~ 150 155 160 

tgt gcg aaa gat aaa aac aaa gcg ttt gat gat gtg atg gca ggt aaa 
Cys Ala Lys Asp Lys Asn Lys Ala Phe Asp Asp Val Met Ala Gly Lys 
165 170 175 

age gtc gca cca gee agt tgc gac gtg gat att gee gac cat tac gca 576 
Ser Val Ala Pro Ala Ser Cys Asp Val Asp He Ala Asp His Tyr Ala 
180 185 190 

ctt ggc gtc cag ctt ggc gtt age ggt act ccg gca gtt gtg ctg age 624 
Leu Gly Val Gin Leu Gly Val Ser Gly Thr Pro Ala Val Val Leu Ser 
195 200 205 

aat ggc aca ctt gtt ccg ggt tac cag ccg ccg aaa gag atg aaa gaa 
Asn Gly Thr Leu Val Pro Gly Tyr Gin Pro Pro Lys Glu Met Lys Glu 
210 215 220 

ttc etc gac gaa cac caa aaa atg acc a- c ggt aaa taa 711 
Phe Leu Asp Glu His Gin Lys Met Thr S^r Gly Lys * 
225 " 230 235 



288 



336 



384 



432 



480 



528 



672 



<210> 191 
<211> 897 
<212> DNA 

<213> Escherichia coli 
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<220> 

<221> CDS 

<222> (1) . . . (897) 

<400> 181 

gtg aaa cag gat ctg gca cgc ate gag cag ttt ctt gat get ctg tgg 48 

Met Lys Gin Asp Leu Ala Arg lie Glu Gin Phe Leu Asp Ala Leu Trp 

1 - 5 10 15 

ctg gaa aaa aat ctg get gaa aat acg ttg aac get tac cgt cgc gat 96 

Leu Glu Lys Asn Leu Ala Glu Asn Thr Leu Asn Ala Tyr Arg Arg Asp 

20 25 30 

ctg tea atg atg gtg gag tgg ttg cat cac cgc ggg ttg acg ctg gcg 14 4 

Leu Ser Met Met Val Glu Trp Leu His His Arg Gly Leu Thr Leu Ala 

35 40 45 

acg gcg caa agt gac gat ttg cag gca tta ctg gca gaa egg ctg gag 192 

Thr Ala Gin Ser Asp Asp Leu Gin Ala Leu Leu Ala Glu Arg Leu Glu 

50 55 60 

ggc ggg tat aaa gee ace age tea gcg cgt ttg ctg agt gca gtg cgc 240 

Gly Gly Tyr Lys Ala Thr Ser Ser Ala Arg Leu Leu Ser Ala Val Arg 

65 " 70 75 80 

cga ttg ttc cag tat ctt tat cgc gaa aag ttt cgt gaa gac gat ccc 288 

Arg Leu Phe Gin Tyr Leu Tyr Arg Glu Lys Phe Arg Glu Asp Asp Pro 

85 90 95 

agt gcg cat etc get tea ccg aaa ttg ccc cag cgt ttg cca aaa gat 336 

Ser Ala His Leu Ala Ser Pro Lys Leu Pro Gin Arg Leu Pro Lys Asp 

100 105 110 

tta age gaa gcg cag gtc gaa cgt tta tta cag gca cca tta ate gat 384 

Leu Ser Glu Ala Gin Val Glu Arg Leu Leu Gin Ala Pro Leu lie Asp 

115 120 125 

cag cca ctg gag eta cgc gat aaa gee atg ctt gaa gtg ttg tat get 4 32 

Gin Pro Leu Glu Leu Arg Asp Lys Ala Met Leu Glu Val Leu Tyr Ala 

130 135 140 

ace ggg ctg cgt gtc tct gaa ctg gtc gga ctg aca atg agt gat ate 480 

Thr Gly Leu Arg Val Ser Glu Leu Val Gly Leu Thr Met Ser Asp lie 

145 150 155 160 

age ctg cgt cag ggc gtg gta egg gtc att ggt aaa ggc aac aaa gag 528 

Ser Leu Arg Gin Gly Val Val Arg Val lie Gly Lys Gly Asn Lys Glu 

165 170 175 

cgt ctg gtg ccg tta ggt gaa gag gcg gtt tac tgg ctg gaa ace tat 576 

Arg Leu Val Pro Leu Gly Glu Glu Ala Val Tyr Trp Leu Glu Thr Tyr 

180 185 190 

ctg gaa cat ggg cgt ccg tgg ctg ctg 'aat ggt gtg tea att gac gtg 624 

Leu Glu His Gly Arg Pro Trp Leu Leu Asn Gly Val Ser lie Asp Val 

195 ^ 200 205 

ttg ttt ccc age cag cgt gcg cag cag atg acg cga cag ace ttc tgg 672 

Leu Phe Pro Ser Gin Arg Ala Gin Gin Met Thr Arg Gin Thr Phe Trp 
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210 215 220 

cac cgt att aaa cat tat get gtg ctg gcg ggt ate gac age gaa aag 720 
His Arg lie Lys His Tyr Ala Val Leu Ala Gly lie Asp Ser Glu Lys 
225 230 235 240 

ctg tea ccg cat gtg ttg cgt cac get ttt gee act cat tta tta aat 7 68 

Leu Ser Pro His Val Leu Arg His Ala Phe Ala Thr His Leu Leu Asn 
245 250 255 

cat ggt gcg gat tta cgc gtg gtg cag atg eta ctg ggc cac age gat 816 
His Gly Ala Asp Leu Arg Val Val Gin Met Leu Leu Gly His Ser Asp 
260 265 270 

etc tec acc acg caa att tat acg cat gtc get ace gag cgt ctg egg 864 
Leu Ser Thr Thr Gin He Tyr Thr His Val Ala Thr Glu Arg Leu Arg 
275 280 285 

caa ctt cat caa cag cat cac ccg egg gcg tga 897 
Gin Leu His Gin Gin His His Pro Arg Ala * 
290 295 



<210> 182 
<211> 741 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . . (741) 

<400> 182 

atg ttt ttt aac acc aaa cat aca aca get tta tgc ttc gta acc tgt 48 
Met Phe Phe Asn Thr Lys His Thr Thr Ala Leu Cys Phe Val Thr Cys 
15 10 15 

atg get ttt agt tea teg tct att gcg gac att gtc att teg ggt act 96 
Met Ala Phe Ser Ser Ser Ser He Ala Asp He Val He Ser Gly Thr 
20 25 30 

cgc gta ata tat aaa age gat caa aaa agt gtc aac gta cgt ctg gaa 144 
Arg Val He Tyr Lys Ser Asp Gin Lys Ser Val Asn Val Arg Leu Glu 
35 40 45 

aat aaa ggg aat aac ccg ttg ctt gtc cag agt tgg tta gat act ggc 192 
Asn Lys Gly Asn Asn Pro Leu Leu Val Gin Ser Trp Leu Asp Thr Gly 
50 55 60 

gat gac aac get gag cct ggc agt att aca gtc cct ttt act get acg 240 
Asp Asp Asn Ala Glu Pro Gly Ser He Thr Val Pro Phe Thr Ala Thr 
65 70 75 80 

ccg cca gta teg cgt att gat gee aaa cgt ggg caa aca ate aaa tta 288 
Pro Pro Val Ser Arg He Asp Ala Lys Arg Gly Gin Thr He Lys Leu 
85 90 95 

atg tac aca gec age acc tea ctg cct aaa gac aga gag age gtg ttc 336 
Met Tyr Thr Ala Ser Thr Ser Leu Pro Lys Asp Arg Glu Ser Val Phe 
100 105 110 
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tgg ttt aac gta ctg gaa gtt cca cca aaa cca gat gca gaa aag gtc 384 
Trp Phe Asn Val Leu Glu Val Pro Pro Lys Pro Asp Ala Glu Lys Val 
115 120 125 

gcg aat caa age ctg ctg caa ctg gca ttt cgc aca cgt ata aaa ctt 4 32 

Ala Asn Gin Ser Leu Leu Gin Leu Ala Phe Arg Thr Arg lie Lys Leu 
130 135 140 

ttc tat cgc ccg gat gga ttg aag gga aat ccc tct gaa gcc ccg tta 4 80 

Phe Tyr Arg Pro Asp Gly Leu Lys Gly Asn Pro Ser Glu Ala Pro Leu 
145 150 155 160 

gcc ctg aag tgg ttc tgg tea ggt tea gaa ggt aag gcg tea tta cga 528 
Ala Leu Lys Trp Phe Trp Ser Gly Ser Glu Gly Lys Ala Ser Leu Arg 
165 170 175 

gtg ace aat cca ace cct tac tac gtc tct ttt age agt ggt gat tta 576 
Val Thr Asn Pro Thr Pro Tyr Tyr Val Ser Phe Ser Ser Gly Asp Leu 
180 185 190 

gaa get age ggt aaa cgc tat ccg att gat gtg aaa atg att gca cca 624 
Glu Ala Ser Gly Lys Arg Tyr Pro He Asp Val Lys Met He Ala Pro 
195 200 205 

ttt agt gat gag gtc atg aaa gtc aat ggc ctt aat ggc aaa gcg aat 672 
Phe Ser Asp Glu Val Met Lys Val Asn Gly Leu Asn Gly Lys Ala Asn 
210 215 220 

tct gca aaa gtg cat ttt tac gcc att aat gac ttt. ggt ggc gca att 720 
Ser Ala Lys Val His Phe Tyr Ala He Asn Asp Phe Gly Gly Ala He 
225 230 235 240 

gaa ggt aat gcc agg ctg taa 741 
Glu Gly Asn Ala Arg Leu * 
245 



<210> 183 
<211> 2598 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . . (2598) 

<400> 183 

gtg act ata gaa tat act aaa aat tat cat cat ctg acc cgt ate gcc 48 
Met Thr He Glu Tyr Thr Lys Asn Tyr His His Leu Thr Arg He Ala 
15 10 15 

acg ttt tgc gcg ctg ctg tat tgc aat act get ttc agt get gaa etc 96 
Thr Phe Cys Ala Leu Leu Tyr Cys Asn Thr Ala Phe Ser Ala Glu Leu 
20 25 30 

gtt gaa tat gac cat acc ttc ctg atg ggg cag aat gca tct aat att 14 4 

Val Glu Tyr Asp His Thr Phe Leu Met Gly Gin Asn Ala Ser Asn He 
35 40 45 
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gat etc age egg tac agt gaa ggt aac ccc get at a ccg ggt gtt tac 192 

Asp Leu Ser Arg Tyr Ser Glu Gly Asn Pro Ala lie Pro Gly Val Tyr 

50 55 60 

gac gtc agt gtt tat gta aac gac caa cca ate att aac caa agt att 240 

Asp Val Ser Val Tyr Val Asn Asp Gin Pro lie lie Asn Gin Ser lie 

65 70 75 80 

aca ttt gtc gca att gaa gga aaa aag aac gee cag get tgt ate aca 288 

Thr Phe Val Ala lie Glu Gly Lys Lys Asn Ala Gin Ala Cys lie Thr 

85 90 95 

tta aag aat tta ttg cag ttt cat att aat tct ccc gat ata aat aac 336 

Leu Lys Asn Leu Leu Gin Phe His lie Asn Ser Pro Asp lie Asn Asn 

100 105 110 

gaa aaa gec gtt ctg ctt gec agg gat gaa acg etc ggc aat tgc etc 384 

Glu Lys Ala Val Leu Leu Ala Arg Asp Glu Thr Leu Gly Asn Cys Leu 

115 120 ~ 125 

aat ttg acg gaa att ate cct cag get tct gtt cgt tat gac gtt aac 4 32 

Asn Leu Thr Glu lie lie Pro Gin Ala Ser Val Arg Tyr Asp Val Asn 

130 135 140 

gat caa cgt ctg gat ata gac gtt cct caa gec tgg gta atg aaa aat 480 

Asp Gin Arg Leu Asp lie Asp Val Pro Gin Ala Trp Val Met Lys Asn 

145 150 155 160 

tac caa aac tat gtt gat cca teg tta tgg gaa aac ggc att aat gcg 528 

Tyr Gin Asn Tyr Val Asp Pro Ser Leu Trp Glu Asn Gly lie Asn Ala 

165 170 175 

gec atg ttg tea tac aac etc aac gga tat cat agt gaa acc cct ggt 576 

Ala Met Leu Ser Tyr Asn Leu Asn Gly Tyr His Ser Glu Thr Pro Gly 

180 185 190 

cga aaa aat gaa age att tat get gca ttt aac ggt ggg atg aat tta 624 

Arg Lys Asn Glu Ser lie Tyr Ala Ala Phe Asn Gly Gly Met Asn Leu 

195 200 205 

ggt gca tgg cga ctg cgt gee teg ggc aac tac aac tgg atg acc gat 672 

Gly Ala Trp Arg Leu Arg Ala Ser Gly Asn Tyr Asn Trp Met Thr Asp 

210 215 220 

tct ggc agt aat tat gat ttt aag aat egg tat gtt cag cgt gat ate 720 

Ser Gly Ser Asn Tyr Asp Phe Lys Asn Arg Tyr Val Gin Arg Asp lie 

225 230 235 240 

gee teg ctg cgt tct caa etc att ctt ggt gag tct tat acg acg ggc 768 

Ala Ser Leu Arg Ser Gin Leu lie Leu Gly Glu Ser Tyr Thr Thr Gly 

245 250 255 

gaa acc ttt gat tec gtc agt ate cga ggc att cgt tta tac agt gac 816 

Glu Thr Phe Asp Ser Val Ser lie Arg Gly lie Arg Leu Tyr Ser Asp 

260 265 270 

age cgc atg ttg cct ccg act tta gee age ttt gcg cct ate att cat 864 

Ser Arg Met Leu Pro Pro Thr Leu Ala Ser Phe Ala Pro lie lie His 

275 280 285 
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ggc gtt gcc aat acc aac gcc aaa gtc act att acg caa ggt v ggc tat 912 
Gly Val Ala Asn Thr Asn Ala Lys Val Thr He Thr Gin Gly Gly Tyr 
290 295 300 

aag att tat gaa acg acg gtg ccg cca ggc get ttc gtc att gat gat 960 
Lys He Tyr Glu Thr Thr Val Pro Pro Gly Ala Phe Val He Asp Asp 
305 " 310 315 320 

ctg agt ccg tea ggg tac ggc age gat ctt att gtt acc ate gaa gaa 1008 
Leu Ser Pro Ser Gly Tyr Gly Ser Asp Leu He Val Thr He Glu Glu 
325 * * 330 335 

tec gat ggc tea aag egg aca ttc teg caa cct ttc tea tec gtt gtt 1056 
Ser Asp Gly Ser Lys Arg Thr Phe Ser Gin Pro Phe Ser Ser Val Val 
340 345 350 

caa atg tta cgc cct ggc gtt gga cgt tgg gat att age ggc ggt cag 1104 
Gin Met Leu Arg Pro Gly Val Gly Arg Trp Asp He Ser Gly Gly Gin 
355 360 365 

gtc tta aaa gat gat att cag gat gag cct aat tta ttt caa gca age 1152 
Val Leu Lys Asp Asp He Gin Asp Glu Pro Asn Leu Phe Gin Ala Ser 
370 ^ *~ 375 380 

tac tac tac ggc ctg aat aac tat ctg acg ggt tat acc ggt att cag 1200 
Tyr Tyr Tyr Gly Leu Asn Asn Tyr Leu Thr Gly Tyr Thr Gly He Gin 
385 ^ d 390 395 400 

ata acc gat aat aac tat acc get ggt ttg tta ggt ctt ggt ctg aat 1248 
He Thr Asp Asn Asn Tyr Thr Ala Gly Leu Leu Gly Leu Gly Leu Asn 
405 410 415 

act tea gtt ggt gca ttt tct ttc gat gtg act cat tec aat gtt cgt 12 96 

Thr Ser Val Gly Ala Phe Ser Phe Asp Val Thr His Ser Asn Val Arg 
420 425 430 

ate ccg gat gat aaa aca tac cag ggg caa agt tat cgt gtt tec tgg 1344 
He Pro Asp Asp Lys Thr Tyr Gin Gly Gin Ser Tyr Arg Val Ser Trp 
435 440 445 

aac aag tta ttc gaa gaa aca agt -act tea ctg aat ate gcg gcc tat 1392 
Asn Lys Leu Phe Glu Glu Thr Ser Thr Ser Leu Asn He Ala Ala Tyr 
450 455 460 

cgc tat teg aca cag aat tac ctt ggt ctt aat gat gca eta act eta 1440 
Arg Tyr Ser Thr Gin Asn Tyr Leu Gly Leu Asn Asp Ala Leu Thr Leu 
465 470 475 480 

att gat gaa gtg aaa cat ccc gaa caa gat ctt gaa ccg aaa tec atg 1488 
He Asp Glu Val Lys His Pro Glu Gin Asp Leu Glu Pro Lys Ser Met 
485 490 495 

cgt aat tac tea cgc atg aaa aat cag gtt acg gtc agt att aac caa 1536 
Arg Asn Tyr Ser Arg Met Lys Asn Gin Val Thr Val Ser He Asn Gin 
500 ' 505 510 

ccg ttg aaa ttt gag aaa aaa gat tac ggt tea ttt tat ctt tec gga 1584 
Pro Leu Lys Phe Glu Lys Lys Asp Tyr Gly Ser Phe Tyr Leu Ser Gly 
515 520 525 
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agt tgg tec gat tac tgg get tec gga caa aat cgt age aat tac tct 1632 
Ser Trp Ser Asp Tyr Trp Ala Ser Gly Gin Asn Arg Ser Asn Tyr Ser 
530 535 540 

att ggc tac agt aac agt aca tec tgg ggc age tac agt gtc agt gec 1680 
He Gly Tyr Ser Asn Ser Thr Ser Trp Gly Ser Tyr Ser Val Ser Ala 
545 550 555 560 

cag cgt tea tgg aat gaa gac ggc gac act gac gat age gtt tat ctt 1728 
Gin Arg Ser Trp Asn Glu Asp Gly Asp Thr Asp Asp Ser Val Tyr Leu 
565 570 575 

agt ttc acc att cca att gaa aaa tta ctt ggc act gaa caa cgt act 1776 
Ser Phe Thr He Pro He Glu Lys Leu Leu Gly Thr Glu Gin Arg Thr 
580 585 590 

tea ggt ttc cag agt att gat act caa ata age agt gac ttt aag ggt 1824 
Ser Gly Phe Gin Ser He Asp Thr Gin He Ser Ser Asp Phe Lys Gly 
595 600 605 

aat aac caa etc aac gtt age age agt ggc tat age gat aac get cgc 1872 
Asn Asn Gin Leu Asn Val Ser Ser Ser Gly Tyr Ser Asp Asn Ala Arg 
610 615 620 



gtc agt tat age gtg aat act ggc tat acg atg aat aaa gee age aaa 
Val Ser Tyr Ser Val Asn Thr Gly Tyr Thr Met Asn Lys Ala Ser Lys 
625 630 635 640 



1920 



gat ttg agt tat gtt ggg ggt tat gee age tat gag tea cca tgg gga 1968 

Asp Leu Ser Tyr Val Gly Gly Tyr Ala Ser Tyr Glu Ser Pro Trp Gly 
645 650 655 

acg ctg gca ggt tea att tct gca aat age gat aac age cgt caa gtt 2016 

Thr Leu Ala Gly Ser He Ser Ala Asn Ser Asp Asn Ser Arg Gin Val 
660 665 670 

tct etc age acc gac ggt ggt ttt gta ttg cat age ggt gga ctg act 2064 

Ser Leu Ser Thr Asp Gly Gly Phe Val Leu His Ser Gly Gly Leu Thr 
675 680 685 

ttc agt aat gat agt ttt age gac tec gat aca ctg gcg gta gtt cag 2112 

Phe Ser Asn Asp Ser Phe Ser Asp Ser Asp Thr Leu Ala Val Val Gin 

690 695 700 

get cca ggt get caa gga gcg cga ata aat tat ggc aac agt act ate 2160 

Ala Pro Gly Ala Gin Gly Ala Arg He Asn Tyr Gly Asn Ser Thr He 
705 710 715 720 

gat cga tgg ggt tat ggt gtc acc age get ctt tct cct tat cat gaa 2208 

Asp Arg Trp Gly Tyr Gly Val Thr Ser Ala Leu Ser Pro Tyr His Glu 
725 730 735 

aac cgt ate gcg ctg gat ate aac gat ctt gag aac gat gtt gaa tta 2256 

Asn Arg He Ala Leu Asp He Asn Asp Leu Glu Asn Asp Val Glu Leu 
740 745 750 

aaa agt acc agt gca gta get gta ccg cgt cag ggt tea gtc gtc ttt 2304 

Lys Ser Thr Ser Ala Val Ala Val Pro Arg Gin Gly Ser Val Val Phe 
755 760 765 
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get gat ttt gaa acc gtg caa ggg caa tea gee att atg aac ate aca 2352 

Ala Asp Phe Glu Thr Val Gin Gly Gin Ser Ala lie Met Asn He Thr 

770 775 780 

cga agt gat ggt aaa aat att cca ttt get gca gat att tat gat gag 24 00 

Arg Ser Asp Gly Lys Asn He Pro Phe Ala Ala Asp He Tyr Asp Glu 
785 " 790 795 800 

caa ggc aat gtc att ggt aat gtt gga cag ggt gga caa gca ttt gtt 24 48 

Gin Gly Asn Val He Gly Asn Val Gly Gin Gly Gly Gin Ala Phe Val 

805 810 815 

cgt ggt att gag cag cag gga aat ate age att aaa tgg etc gaa caa 24 96 

Arg Gly He Glu Gin Gin Gly Asn He Ser He Lys Trp Leu Glu Gin 
820 825 830 

agt aaa ccc gta agt tgt ctt gcg cat tat caa caa age cca gaa gca 2544 

Ser Lys Pro Val Ser Cys Leu Ala His Tyr Gin Gin Ser Pro Glu Ala 
835 840 845 

gaa aaa ata gca caa tct att att ctg aat gga att egg tgt cag att 2592 

Glu Lys He Ala Gin Ser He He Leu Asn Gly He Arg Cys Gin He 

850 855 860 

cag taa 2598 

Gin * 

865 



<210> 184 
<211> 1986 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . . (1986) 

<400> 184 

atg aaa acc gtt agg gag tec aca acg ttg tac aac ttt etc gga teg 48 

Met Lys Thr Val Arg Glu Ser Thr Thr Leu Tyr Asn Phe Leu Gly Ser 
1 5 10 15 

cac aat cca tac tgg egg ttg acg gaa age age gat gtt ttg cgc ttt 96 
His Asn Pro Tyr Trp Arg Leu Thr Glu Ser Ser Asp Val Leu Arg Phe 
20 25 30 

tct acc acc gaa acc aca gaa cct gat cgt aca ttg cag tta tct gec 144 
Ser Thr Thr Glu Thr Thr Glu Pro Asp Arg Thr Leu Gin Leu Ser Ala 
35 40 45 

gaa cag get get cgc ate agg gaa atg acg gtc ate acc tec age ctg 192 
Glu Gin Ala Ala Arg He Arg Glu Met Thr Val He Thr Ser Ser Leu 
50 55 60 

atg atg agt ctg acc gtc gat gaa age gat ctt tct gtg cat ctg gta 240 
Met Met Ser Leu Thr Val Asp Glu Ser Asp Leu Ser Val His Leu Val 
65 70 75 80 

gga cga aaa ate aat aaa egg gaa tgg get ggc aac gcg tct gca tgg 288 
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Gly Arg Lys lie Asn Lys Arg Glu Trp Ala Gly Asn Ala Ser Ala Trp 
85 90 95 

cat gac aca ccg gcg gtt get cgt gat tta tea cac ggg ctt tec ttt 336 
His Asp Thr Pro Ala Val Ala Arg Asp Leu Ser His Gly Leu Ser Phe 
100 105 110 

get gag cag gta gtt tct gaa gca cat tec gca ata gtg att etc gac 384 
Ala Glu Gin Val Val Ser Glu Ala His Ser Ala He Val He Leu Asp 
115 120 125 

age egg ggg aat ate caa cgc ttc aat egg tta tgt gaa gat tac aca 432 
Ser Arg Gly Asn He Gin Arg Phe Asn Arg Leu Cys Glu Asp Tyr Thr 
130 135 140 

ggg ttg aaa gaa cac gac gtc att ggg caa age gtg ttt aaa ctg ttt 480 
Gly Leu Lys Glu His Asp Val He Gly Gin Ser Val Phe Lys Leu Phe 
145 150 155 160 

atg age cgt cgt gaa get gcg gca tec agg cgc aat aac cgt gta ttt 528 
Met Ser Arg Arg Glu Ala Ala Ala Ser Arg Arg Asn Asn Arg Val Phe 
165 170 . 175 

ttt cga age ggc aat gca tat gaa gtc gaa ctg tgg ata cca aca tgt 576 
Phe Arg Ser Gly Asn Ala Tyr Glu Val Glu Leu Trp He Pro Thr Cys 
180 185 190 



aaa ggc cag egg ctg ttt ctg ttt cgc aat aaa ttt gtc cac age ggc 
Lys Gly Gin Arg Leu Phe Leu Phe Arg Asn Lys Phe Val His Ser Gly 
195 200 205 

agt ggc aaa aac gag att ttt tta ate tgt tec ggc acc gac att acc 
Ser Gly Lys Asn Glu He Phe Leu He Cys Ser Gly Thr Asp He Thr 
210 215 220 

gaa gag cgc cgc get cag gag cga ctg cgt att ctg gca aat acc gac 
Glu Glu Arg Arg Ala Gin Glu Arg Leu Arg He Leu Ala Asn Thr Asp 
225 230 235 240 

agt ate acc gga ctg ccg aat cgt aac gca atg cag gat tta ate gat 
Ser He Thr Gly Leu Pro Asn Arg Asn Ala Met Gin Asp Leu He Asp 
245 250 255 

cac get att aat cat gca gat aac aat aaa gtt ggg gtt gtg tat ctt 
His Ala He Asn His Ala Asp Asn Asn Lys Val Gly Val Val Tyr Leu 
260 265 . 270 

gat ttg gat aat ttc aaa aag gtc aac gac gec tat ggg cat ttg ttt 
Asp Leu Asp Asn Phe Lys Lys Val Asn Asp Ala Tyr Gly His Leu Phe 
275 280 285 

ggt gac cag tta tta cgc gac gtg tea ttg get att tta age tgt etc 
Gly Asp Gin Leu Leu Arg Asp Val Ser Leu Ala He Leu Ser Cys Leu 
290 295 300 

gaa cat gac cag gtg ttg gcg cgt cca ggt ggg gat gag ttt ctg gta 
Glu His Asp Gin Val Leu Ala Arg Pro Gly Gly Asp Glu Phe Leu Val 
305 310 315 320 

ctg gca tec aac acc tea caa age gcg ctg gaa gca atg gca tea cga 



624 



672 



720 



768 



816 



864 



912 



960 



1008 
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Leu Ala Ser Asn Thr Ser Gin Ser Ala Leu Glu Ala Met Ala Ser Arg 
325 330 335 

att ttg acc cgc tta egg etc ccc ttt cgc att ggt tta att gaa gtt 1056 
lie Leu Thr Arg Leu Arg Leu Pro Phe Arg lie Gly Leu lie Glu Val 
340 345 350 

tat acc age tgt tea gta ggt att gca etc tct ccc gaa cat ggt tea 1104 
Tyr Thr Ser Cys Ser Val Gly lie Ala Leu Ser Pro Glu His Gly Ser 
355 360 365 

gac age acg get att att cgt cac gec gac aca gca atg tac aca gcg 1152 
Asp Ser Thr Ala lie He Arg His Ala Asp Thr Ala Met Tyr Thr Ala 
370 375 380 

aag gaa ggc gga cga gga caa ttt tgc gtt ttt acc cca gaa atg aat 1200 
Lys Glu Gly Gly Arg Gly Gin Phe Cys Val Phe Thr Pro Glu Met Asn 
385 " 390 395 400 

caa egg gta ttt gaa tat etc tgg ctg gat acc aac ttg cgt aaa gca 1248 
Gin Arg Val Phe Glu Tyr Leu Trp Leu Asp Thr Asn Leu Arg Lys Ala 
405 410 415 

ctg gaa aac gat cag ttg gtt att cac tat caa ccg aaa ate acc tgg 1296 
Leu Glu Asn Asp Gin Leu Val He His Tyr Gin Pro Lys He Thr Trp 
420 425 430 

cgt ggc gaa gtg cgc agt ctg gaa gca eta gta cgt tgg cag tea cct 1344 
Arg Gly Glu Val Arg Ser Leu Glu Ala Leu Val Arg Trp Gin Ser Pro . 
435 440 445 

gaa cgt ggg ttg att cca ccg ttg gac ttc att tec tac gec gaa gag 1392 
Glu Arg Gly Leu He Pro Pro Leu Asp Phe He Ser Tyr Ala Glu Glu 
450 455 460 

tea ggg eta att gtg cct tta ggc cgt tgg gtg att etc gat gtc gta 1440 
Ser Gly Leu He Val Pro Leu Gly Arg Trp Val He Leu Asp Val Val 
465 470 475 480 

cgc cag gtg gca aag tgg egg gat aaa ggc ata aac ctg cga gtg gcg 1488 
Arg Gin Val Ala Lys Trp Arg Asp Lys Gly He Asn Leu Arg Val Ala 
485 490 495 

gta aat att tct gca cgt cag etc gee gat caa acc att ttc acc gec 1536 
Val Asn He Ser Ala Arg Gin Leu Ala Asp Gin Thr He Phe Thr Ala 
500 505 510 

ctg aaa cag gtt etc cag gaa etc aat ttt gaa tac tgc cct ata gat 1584 
Leu Lys Gin Val Leu Gin Glu Leu Asn Phe Glu Tyr Cys Pro lie Asp 
515 520 525 

gtt gaa ctg aca gag agt tgt ctg att gag aat gat gaa ctg gca ctg 1632 
Val Glu Leu Thr Glu Ser Cys Leu He Glu Asn Asp Glu Leu Ala Leu 
530 535 540 

tct gtt att caa caa ttt age caa eta ggt gcg caa gtg cat ctg gat 1680 
Ser Val He Gin Gin Phe Ser Gin Leu Gly Ala Gin Val His Leu Asp 
545 550 555 560 

gat ttt ggt acc ggc tac tct tea ctt teg caa ctg gcg cgc ttt ccg 1728 
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Asp Phe Gly Thr Gly Tyr Ser Ser Leu Ser Gin Leu Ala Arg Phe Pro 
565 570 575 

ate gat gec ate aaa ctt gac cag gtt ttt gtt cga gat att cac aaa 
lie Asp Ala lie Lys Leu Asp Gin Val Phe Val Arg Asp lie His Lys 
580 585 590 

caa cct gtc teg cag tea ctg gtc egg gcg ate gtc get gtg gee cag 
Gin Pro Val Ser Gin Ser Leu Val Arg Ala He Val Ala Val Ala Gin 
595 600 605 

gca ttg aat ctt cag gtg ate gee gaa ggt gta gag agt gca aag gaa 
Ala Leu Asn Leu Gin Val He Ala Glu Gly Val Glu Ser Ala Lys Glu 
610 615 620 

gat get ttt tta acc aag aac ggg ate aat gag egg caa gga ttt ttg 
Asp Ala Phe Leu Thr Lys Asn Gly He Asn Glu Arg Gin Gly Phe Leu 
625 630 635 640 

ttt gee aaa ccg atg ccc gec gtc gec ttc gaa cgc tgg tat aaa cgc 
Phe Ala Lys Pro Met Pro Ala Val Ala Phe Glu Arg Trp Tyr Lys Arg 
645 650 655 

•tat ctg aag cgc gca taa 
Tyr Leu Lys Arg Ala * 
660 



<210> 185 
<211> 693 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) ... (693) 



1776 



1824 



1872 



1920 



1968 



1986 



a^g°atg 8 act aaa ata aag tta ttg atg etc att at a ttt tat tta ate 48 
Met Met Thr Lys He Lys Leu Leu Met Leu He He Phe Tyr Leu He 
- 5 10 15 



1 



att teg gee age gec cat get gee gga ggg ate gca tta ggt gec acg 
He Ser Ala Ser Ala His Ala Ala Gly Gly He Ala Leu Gly Ala Thr 

25 30 



20 



cgt att att tat ccc get gat get aaa cag act gcg gta tgg att aga 
Arq He He Tyr Pro Ala Asp Ala Lys Gin Thr Ala Val Trp He Arg 
35 40 45 

aat age cat acc aat gag cgc ttt ctg gtc aat teg tgg att gaa aac 
Asn Ser His Thr Asn Glu Arg Phe Leu Val Asn Ser Trp He Glu Asn 
50 55 60 

aqc age ggt gta aaa gaa aag tea ttc ate att aca ccg cca ctg ttt 
Ser Ser Gly Val Lys Glu Lys Ser Phe He He Thr Pro Pro Leu Phe 
65 ~ 70 75 8° 

gtt agt gaa ccc aaa age gaa aat act ttg cgt att att tac acc ggt 
Val Ser Glu Pro Lys Ser Glu Asn Thr Leu Arg He He Tyr Thr Gly 



96 



144 



192 



240 



288 
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85 90 95 

cca ccg ctg gca gca gat cgt gag tct ctg ttc tgg atg aat gtt aag 336 
Pro Pro Leu Ala Ala Asp Arg Glu Ser Leu Phe Trp Met Asn Val Lys 
100 105 110 

acg ate cct teg gta gat aaa aat gca ttg aac ggc agg aat gtt ttg 384 
Thr lie Pro Ser Val Asp Lys Asn Ala Leu Asn Gly Arg Asn Val Leu 
115 120 125 

caa ctg gcg att tta teg cgc atg aaa tta ttt etc cgt cca att caa 4 32 

Gin Leu Ala lie Leu Ser Arg Met Lys Leu Phe Leu Arg Pro lie Gin 
130 135 140 

tta caa gaa tta ccc gca gaa gcg ccg gac aca etc aag ttt teg cga 4 80 

Leu Gin Glu Leu Pro Ala Glu Ala Pro Asp Thr Leu Lys Phe Ser Arg 
145 150 155 160 

tec ggt aac tat ate aat gtt cat aat cca tea cct ttt tat gtc ace 528 
Ser Gly Asn Tyr lie Asn Val His Asn Pro Ser Pro Phe Tyr Val Thr 
165 170 175 

ctg gtt aac tta caa gtg ggc age caa aag ttg ggg aat get atg get 576 
Leu Val Asn Leu Gin Val Gly Ser Gin Lys Leu Gly Asn Ala Met Ala 
180 185 190 

gca ccc aga gtt aat tea caa att ccc tta ccc tea gga gtg cag gga 624 
Ala Pro Arg Val Asn Ser Gin lie Pro Leu Pro Ser Gly Val Gin Gly 
195 200 205 

aag ctg aaa ttt cag ace gtt aat gat tat ggt tea gta act ccg gtc 672 
Lys Leu Lys Phe Gin Thr Val Asn Asp Tyr Gly Ser Val Thr Pro Val 
210 215 220 

aga gaa gtg aac tta aac taa 693 
Arg Glu Val Asn Leu Asn * 
225 230 



<210> 186 
<211> 2604 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . . (2604) 

<400> 186 

atg aaa ata ccc act act acg gat att ccg cag agg tat ace tgg tgt 48 
Met Lys He Pro Thr Thr Thr Asp He Pro Gin Arg Tyr Thr Trp Cys 
15 10 15 

ctg gec gga att tgt tat tea tct ctt gee att tta ccc tec ttt tta 96 
Leu Ala Gly He Cys Tyr Ser Ser Leu Ala He Leu Pro Ser Phe Leu 
20 25 30 

age tat gcg gaa agt tat ttc aac ccg gca ttt tta tta gag aat ggc 144 
Ser Tyr Ala Glu Ser Tyr Phe Asn Pro Ala Phe Leu Leu Glu Asn Gly 
35 ' 40 45 
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aca tec gtt get gat tta teg cgc ttt gag aga ggt aat cat caa cct 192 
Thr Ser Val Ala Asp Leu Ser Arg Phe Glu Arg Gly Asn His Gin Pro 
50 " 55 60 



gcg ggc gtg tat egg gtg gat etc tgg cgt aat gat gag ttc att ggt 
Ala Gly Val Tyr Arg Val Asp Leu Trp Arg Asn Asp Glu Phe lie Gly 
65 70 75 80 

teg cag gat ate gta ttt gaa teg aca aca gaa aat aca ggt gat aaa 
Ser Gin Asp He Val Phe Glu Ser Thr Thr Glu Asn Thr Gly Asp Lys 
85 90 95 



aaa tgc ate aat tta ctg aaa get gta cct gat gee aca att aac ttt 

Lys Cys lie Asn Leu Leu Lys Ala Val Pro Asp Ala Thr He Asn Phe 
130 135 140 

gat ttt gca gcg atg cgc ctg aac ate act att cct cag ata gcg ttg 

Asp Phe Ala Ala Met Arg Leu Asn He Thr He Pro Gin He Ala Leu 
145 150 155 160 

ttg agt age get cac ggt tac att ccg cct gaa gag tgg gat gaa ggt 

Leu Ser Ser Ala His Gly Tyr He Pro Pro Glu Glu Trp Asp Glu Gly 

165 170 175 



ccg tgg cgt tta cgc aac aat ggt tec tgg aac tat ttt cgc gga aat 
Pro Trp Arg Leu Arg Asn Asn Gly Ser Trp Asn Tyr Phe Arg Gly Asn 
210 215 220 



gec att att ccg ctg aaa agt gaa ctg gta atg gga gac ggc aat aca 
Ala He He Pro Leu Lys Ser Glu Leu Val Met Gly Asp Gly Asn Thr 
245 250 255 

gga agt gat att ttc gat ggc gtt gga ttt cgt ggt gta egg ctt tat 
Gly Ser Asp He Phe Asp Gly Val Gly Phe Arg Gly Val Arg Leu Tyr 
260 265 270 

tct tct gat aat atg tat cct gat age cag caa ggg ttt gee cca acg 
Ser Ser Asp Asn Met Tyr Pro Asp Ser Gin Gin Gly Phe Ala Pro Thr 
275 280 285 



240 



288 



tea ggt ggg tta atg ccc tgt ttt aac cag gta ctt ctt gaa cga att 336 
Ser Gly Gly Leu Met Pro Cys Phe Asn Gin Val Leu Leu Glu Arg He 
100 105 110 

ggc ctt aat age agt gca ttt ccc gag tta gee cag cag caa aac aat 384 
Gly Leu Asn Ser Ser Ala Phe Pro Glu Leu Ala Gin Gin Gin Asn Asn 
115 120 125 



432 



480 



528 



att cct get tta etc ctg aat tat aat ttc ace ggt aac aga ggt aat 576 

He Pro Ala Leu Leu Leu Asn Tyr Asn Phe Thr Gly Asn Arg Gly Asn 

180 185 190 

ggt aac gat age tat ttt ttt agt gag etc age ggg att aat att ggc 624 

Gly Asn Asp Ser Tyr Phe Phe Ser Glu Leu Ser Gly He Asn He Gly 
195 200 205 



672 



gga tat cat tea gaa cag tgg aat aat att ggc ace tgg gta cag cgc 720 
Gly Tyr His Ser Glu Gin Trp Asn Asn He Gly Thr Trp Val Gin Arg 
225 230 235 240 



768 



816 



864 
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gta cgt ggg att gcc cgt acg gcg gcc cag eta acg att egg caa aat 912 
Val Arg Gly He Ala Arg Thr Ala Ala Gin Leu Thr He Arg Gin Asn 
290 295 300 

ggt ttt att ate tat caa age tat gtt tec ccc ggc get ttt gaa att 960 
Gly Phe He He Tyr Gin Ser Tyr Val Ser Pro Gly Ala Phe Glu He 
305 * 310 315 ~ 320 

aca gat ttg cac ccg aca tct tea aat ggc gat ctg gac gtc ace ate 1008 
Thr Asp Leu His Pro Thr Ser Ser Asn Gly Asp Leu Asp Val Thr He 
325 330 335 

gac gag cgc gat ggc aat cag cag aat tac aca att ccg tat tea aca 1056 
Asp Glu Arg Asp Gly Asn Gin Gin Asn Tyr Thr He Pro Tyr Ser Thr 
340 345 350 

gtg cca att tta caa cgc gaa ggg cgt ttc aaa ttt gac ctg acg gcg 1104 
Val Pro He Leu Gin Arg Glu Gly Arg Phe Lys Phe Asp Leu Thr Ala 
355 360 ' 365 

ggc gat ttt cgt age ggt aat agt cag caa tea teg cct ttc ttt ttt 1152 
Gly Asp Phe Arg Ser Gly Asn Ser Gin Gin Ser Ser Pro Phe Phe Phe 
370 375 380 

cag ggt acg gca etc ggc ggt tta cca cag gaa ttt act gcc tac ggc 1200 
Gin Gly Thr Ala Leu Gly Gly Leu Pro Gin Glu Phe Thr Ala Tyr Gly 
385 390 " 395 400 

ggg acg caa tta tct gcc aat tac acc gcc ttt tta tta ggg ctg ggg 1248 
Gly Thr Gin Leu Ser Ala Asn Tyr Thr Ala Phe Leu Leu Gly Leu Gly 
405 410 415 

cgc aat etc ggg aac tgg ggc gca gtg teg ctg gat gta acg cat gcg 1296 
Arg Asn Leu Gly Asn Trp Gly Ala Val Ser Leu Asp Val Thr His Ala 
420 425 430 

cgc agt cag tta gcc gac gcc agt cgt cat gag ggg gat tct att cgc 1344 
Arg Ser Gin Leu Ala Asp Ala Ser Arg His Glu Gly Asp Ser He Arg 
435 440 445 

ttc etc tat gcg aaa teg atg aac acc ttc ggc acc aat ttt cag tta 1392 
Phe Leu Tyr Ala Lys Ser Met Asn Thr Phe Gly Thr Asn Phe Gin Leu 
450 455 460 

atg ggt tac cgc tat teg aca caa ggt ttt tat acc ctt gat gat gtt 1440 
Met Gly Tyr Arg Tyr Ser Thr Gin Gly Phe Tyr Thr Leu Asp Asp Val 
465 470 475 480 

gcg tat cgt cga atg gag ggg tac gaa tat gat tac gac ggt gag cat 1488 
Ala Tyr Arg Arg Met Glu Gly Tyr Glu Tyr Asp Tyr Asp Gly Glu His 
485 490 495 

cgc gat gaa ccg ata ate gtg aat tac cac aat tta cgc ttt age cgt 1536 
Arg Asp Glu Pro He He Val Asn Tyr His Asn Leu Arg Phe Ser Arg 
500 505 510 

aaa gac cgt ttg cag tta aat gtt tea caa tea ctt aat gac ttt ggc 1584 
Lys Asp Arg Leu Gin Leu Asn Val Ser Gin Ser Leu Asn Asp Phe Gly 
515 520 525 
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teg ctt tat att tct ggt acc cat caa aaa tac tgg aat act teg gat 1632 

Ser Leu Tyr He Ser Gly Thr His Gin Lys Tyr Trp Asn Thr Ser Asp 
530 535 540 

tea gat acg tgg tat cag gtg ggg tat acc age age tgg gtt ggc ate 1680 

Ser Asp Thr Trp Tyr Gin Val Gly Tyr Thr Ser Ser Trp Val Gly He 
545 550 555 560 



agt tat teg etc tea ttt teg tgg aat gaa tct gta ggg ate ccc gat 
Ser Tyr Ser Leu Ser Phe Ser Trp Asn Glu Ser Val Gly He Pro Asp 
565 570 575 



acc aaa cgt cgc tac acc egg gaa aat gcg etc gac cgc get tat gee 
Thr Lys Arg Arg Tyr Thr Arg Glu Asn Ala Leu Asp Arg Ala Tyr Ala 
595 600 605 



age cag ggt gat acc teg aat aat ggg tac acg ggc age gee acg gca 
Ser Gin Gly Asp Thr Ser Asn Asn Gly Tyr Thr Gly Ser Ala Thr Ala 
645 650 655 



1728 



aac gaa cgt att gtc gga ctt aat gtt tea gtg cct ttc aat gtt ttg 1776 
Asn Glu Arg He Val Gly Leu Asn Val Ser Val Pro Phe Asn Val Leu 
580 585 590 



1824 



tec ttt aac gee aac cgt aac age aac ggg caa aat age tgg ctg gca 1872 
Ser Phe Asn Ala Asn Arg Asn Ser Asn Gly Gin Asn Ser Trp Leu Ala 
610 615 620 

ggt gta ggt ggg acc tta ctg gaa ggc cac aac ctg agt tat cac gta 1920 
Gly Val Gly Gly Thr Leu Leu Glu Gly His Asn Leu Ser Tyr His Val 
625 630 635 640 



1968 



aac tgg cag gee get tac ggt acg ctg ggg ggc ggg tat aac tac gac 2016 
Asn Trp Gin Ala Ala Tyr Gly Thr Leu Gly Gly Gly Tyr Asn Tyr Asp 
660 665 670 

cgc gat caa cat gac gtt aac tgg cag ctg tct ggc ggt gtg gtc ggg 2064 
Arg Asp Gin His Asp Val Asn Trp Gin Leu Ser Gly Gly Val Val Gly 
675 680 685 

cat gaa aat ggc ata acg ctg age cag cct tta ggg gat acc aat gtt 2112 
His Glu Asn Gly He Thr Leu Ser Gin Pro Leu Gly Asp Thr Asn Val 
690 695 700 

ttg att aaa gcg cct ggc gca ggc ggt gta cgc att gaa aat caa act 2160 
Leu He Lys Ala Pro Gly Ala Gly Gly Val Arg He Glu Asn Gin Thr 
705 710. 715 720 

ggc att tta acc gac tgg cgc ggc tat gcg gtg atg ctg tat gee acg 2208 
Gly lie Leu Thr Asp Trp Arg Gly Tyr Ala Val Met Leu Tyr Ala Thr 
725 730 735 

gtt tat egg tat aac cgt ate gcg ctt gat acc aat acg atg ggg aat 2256 
Val Tyr Arg Tyr Asn Arg He Ala Leu Asp Thr Asn Thr Met Gly Asn 
740 745 750 

tec ate gat gtt gaa aaa aat att age age gtt gtg ccg acg caa ggc 2304 
Ser He Asp Val Glu Lys Asn He Ser Ser Val Val Pro Thr Gin Gly 
755 760 765 
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gcg ttg gtt cgt gcc aat ttt gat acc cgc ata ggc gtg egg gcg etc 2352 

Ala Leu Val Arg Ala Asn Phe Asp Thr Arg lie Gly Val Arg Ala Leu 

770 775 780 

att acc gtt acc cag ggc gga aaa ccg gtg ccg ttt gga tea ctg gta 2400 

lie Thr Val Thr Gin Gly Gly Lys Pro Val Pro Phe Gly Ser Leu Val 

785 790 795 800 

egg gaa aac agt acc gga ata acc agt atg gtg ggt gat gac ggg caa 24 48 

Arg Glu Asn Ser Thr Gly lie Thr Ser Met Val Gly Asp Asp Gly Gin 

805 810 815 

gtt tat tta agt ggt gcg cca ttg tct ggt gaa tta ctg gtt cag tgg 24 96 

Val Tyr Leu Ser Gly Ala Pro Leu Ser Gly Glu Leu Leu Val Gin Trp 

820 825 830 

gga gac ggc gcg aac tea cgc tgc att gcg cac tat gta ttg ccg aag 2544 

Gly Asp Gly Ala Asn Ser Arg Cys lie Ala His Tyr Val Leu Pro Lys 

835 840 845 

caa age tta cag caa gcc gtc act gtt att teg gca gtt tgc aca cat 2592 

Gin Ser Leu Gin Gin Ala Val Thr Val He Ser Ala Val Cys Thr His 

850 855 860 

cct ggc tea taa 2604 

Pro Gly Ser * 

865 



<210> 187 
<211> 978 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . . (978) 

<400> 187 

atg gca tgt ttg tgt ctg gca aac ata tec tgg get act gtt tgt gca 48 

Met Ala Cys Leu Cys Leu Ala Asn lie Ser Trp Ala Thr Val Cys Ala 
15 10 15 

aat agt act ggc gta gca gaa gat gaa cac tat gat etc tea aat ate 96 
Asn Ser Thr Gly Val Ala Glu Asp Glu His Tyr Asp Leu Ser Asn lie 
20 25 30 

ttt aat age acc aat aac cag cca ggg cag att gtt gtt tta ccg gaa 14 4 

Phe Asn Ser Thr Asn Asn Gin Pro Gly Gin He Val Val Leu Pro Glu 
35 40 45 

aaa tec ggc tgg gta ggt gtc tea gca att tgt cca ccc ggt acg ctg 192 
Lys Ser Gly Trp Val Gly Val Ser Ala He Cys Pro Pro Gly Thr Leu 
50 55 60 

gtg aat tat aca tac cgt agt tat gtc acc aac ttt att gtt cag gaa 24 0 

Val Asn Tyr Thr Tyr Arg Ser Tyr Val Thr Asn Phe He Val Gin Glu 
65 70 75 80 
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act ate gat aat tat aaa tat atg caa tta cat gat tat eta tta ggt 288 
Thr lie Asp Asn Tyr Lys Tyr Met Gin Leu His Asp Tyr Leu Leu Gly 
85 ' 90 95 

gcg atg agt ctg gtt gat agt gtg atg gat att cag ttc ccc ccg caa 336 
Ala Met Ser Leu Val Asp Ser Val Met Asp lie Gin Phe Pro Pro Gin 
100 105 110 

aat tat att egg atg gga aca gat cct aac gtt teg caa aac ctt cca 384 
Asn Tyr lie Arg Met Gly Thr Asp Pro Asn Val Ser Gin Asn Leu Pro 
115 ^ 120 125 

ttc ggg gtg atg gat tct cgt tta ata ttt cgt tta aag gtt att cgt 432 
Phe Gly Val Met Asp Ser Arg Leu lie Phe Arg Leu Lys Val lie Arg 
130 135 140 

ccc ttt att aac atg gtg gag ate ccc aga cag gtg atg ttt ace gtg 4 80 

Pro Phe He Asn Met Val Glu Tie Pro Arg Gin Val Met Phe Thr Val 
145 150 155 160 

tat gtg aca tea acg cct tac gat ccg ttg gtt aca cct gtt tat ace 528 
Tyr Val Thr Ser Thr Pro Tyr Asp Pro Leu Val Thr Pro Val Tyr Thr 
165 " * 170 175 

att agt ttt ggt ggc egg gtt gaa gta ccg caa aac tgc gaa tta aat 576 
He Ser Phe Gly Gly Arg Val Glu Val Pro Gin Asn Cys Glu Leu Asn 
180 185 190 

gee ggg cag att gtt gaa ttt gat ttt ggt gat ate ggc gca teg tta 624 
Ala Gly Gin He Val Glu Phe Asp Phe Gly Asp He Gly Ala Ser Leu 
195 200 205 

ttt agt gcg gca ggg ccg ggt aat cga cct get ggt gtc atg ccg caa 672 
Phe Ser Ala Ala Gly Pro Gly Asn Arg Pro Ala Gly Val Met Pro Gin 
210 215 220 

acc aag age att gcg gtc aaa tgt acg aat gtt get gcg cag get tat 720 
Thr Lys Ser He Ala Val Lys Cys Thr Asn Val Ala Ala Gin Ala Tyr 
225 230 235 240 

tta aca atg cgt ctg gaa gee agt gee gtt tct ggt cag gcg atg gtg 7 68 

Leu Thr Met Arg Leu Glu Ala Ser Ala Val Ser Gly Gin Ala Met Val 
245 250 255 

teg gac aat cag gat tta ggt ttt att gtc gee gat cag aac gat acg 816 
Ser Asp Asn Gin Asp Leu Gly Phe He Val Ala Asp Gin Asn Asp Thr 
260 265 . 270 

ccg ate acg cct aac gat etc aat age gtt . att cct ttc cgt ctg gat 8 64 

Pro He Thr Pro Asn Asp Leu Asn Ser Val He Pro Phe Arg Leu Asp 
275 * 280 285 

gca get gcg gca gee aat gtc aca ctt cgc gee tgg cct ate agt att 912 
Ala Ala Ala Ala Ala Asn Val Thr Leu Arg Ala Trp. Pro He Ser lie 
290 295 300 

acc ggt caa aaa ccg acc gaa ggg ccg ttt age gcg ctg ggg tat tta 960 
Thr Gly Gin Lys Pro Thr Glu Gly Pro. Phe Ser Ala Leu Gly Tyr Leu 
305 310 315 320 
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cgc gtc gat tat caa tga 978 
Arg Val Asp Tyr Gin * 
325 



<210> 189 
<211> 516 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . . (516) 

<400> 188 

atg aga aga gta etc ttt age tgt ttc tgc ggg eta ctg tgg agt tec 48 

Met Arg Arg Val Leu Phe Ser Cys ' Phe Cys Gly Leu Leu Trp Ser Ser 
1 5 10 15 

agt gga tgg gca gtt gac cct tta gga acg att aat ate aat ttg cac 96 
Ser Gly Trp Ala Val Asp Pro Leu Gly Thr lie Asn lie Asn Leu His 
20 25 30 

ggt aac gtt gtt gat ttc tec tgt acc gta aac aca gcg gat att gat 144 
Gly Asn Val Val Asp Phe Ser Cys Thr Val Asn Thr Ala Asp lie Asp 
35 40 45 

aag acg gta gat tta ggc aga tgg cct acg aca caa eta ctg aac get 192 
Lys Thr Val Asp Leu Gly Arg Trp Pro Thr Thr Gin Leu Leu Asn Ala 
50 55 60 

ggc gat acc acg gca etc gtc cct ttt age ctg egg ctg gag gga tgt 240 
Gly Asp Thr Thr Ala Leu Val Pro Phe Ser Leu Arg Leu Glu Gly Cys 
65 70 75 80 

cct ccg ggt tea gtt gcg att tta ttt acg gga acg ccg gca tec gat 288 
Pro Pro Gly Ser Val Ala He Leu Phe Thr Gly Thr Pro Ala Ser Asp 
85 90 95 

acc aac ctg ctg get ctg gat gat ccc gca atg gca caa acc gtc gee 336 
Thr Asn Leu Leu Ala Leu Asp Asp Pro Ala Met Ala Gin Thr Val Ala 
100 105 110 

ate gaa tta cgt aat age gat cgc tec egg etc gca ctg ggg gag gcg 384 
He Glu Leu Arg Asn Ser Asp Arg Ser Arg Leu Ala Leu Gly Glu Ala 
115 " 120 125 

age ccg act gag gaa gta gat gca aat ggc aat gtc aca eta aac ttt 432 
Ser Pro Thr Glu Glu Val Asp Ala Asn Gly Asn Val Thr Leu Asn Phe 
130 135 140 

ttt gec aat tat cga gcg tta gec age ggt gtt egg cca ggt gtg gcg 4 80 

Phe Ala Asn Tyr Arg Ala Leu Ala Ser Gly Val Arg Pro Gly Val Ala 
145 150 155 160 

aaa gcg gat gcg ata ttt atg ate aat tat aat taa 516 
Lys Ala Asp Ala He Phe Met He Asn Tyr Asn * 
165 170 
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<210> 189 
<211> 585 
<212> DNA 

<213> Escherichia coli 



<220> 

<221> CDS 

<222> (1) . . . (585) 

<400> 189 

atg ttt ggc tat cgc agt aac gtg cca aaa gtg cgc tta acc aca gac 
Met Phe Gly Tyr Arg Ser Asn Val Pro Lys Val Arg Leu Thr Thr Asp 
15 10 15 



atg att aac gaa ttt cat aaa caa ggt tea get ttc tac ttt ggc tta 
Met lie Asn Glu Phe His Lys Gin Gly Ser Ala Phe Tyr Phe Gly Leu 
65 70 75 80 



48 



cga ctg gtc gtg cgt ctg gtg cat gat cgt gat gec tgg cgt ctt gcg 96 
Arg Leu Val Val Arg Leu Val His Asp Arg Asp Ala Trp Arg Leu Ala 
20 25 30 

gat tat tac gca gag aat cgc cat ttc etc aag ccc tgg gag cca gtg 14 4 

Asp Tyr Tyr Ala Glu Asn Arg His Phe Leu Lys Pro Trp Glu Pro Val 
35 40 45 

cgc gac gaa age cac tgt tat cca tea ggc tgg cag gec agg ctg ggg 192 
Arg Asp Glu Ser His Cys Tyr Pro Ser Gly Trp Gin Ala Arg Leu Gly 
50 55 60 



240 



ttc gac ccg gac gaa aaa gag att att ggc gtt gec aat ttt tec aat 288 
Phe Asp Pro Asp Glu Lys Glu lie lie Gly Val Ala Asn Phe Ser Asn 
85 90 95 

gtt gtt cgt ggc tct ttt cat gee tgc tat etc ggt tat teg att ggg 336 
Val Val Arg Gly Ser Phe His Ala Cys Tyr Leu Gly Tyr Ser He Gly 
100 105 110 

caa aaa tgg cag ggc aaa gga etc atg ttt gaa gec ctg acc gca gec 384 
Gin Lys Trp Gin Gly Lys Gly Leu Met Phe Glu Ala Leu Thr Ala Ala 
115 * 120 125 

att cgt tat atg cag cgc acc caa cat att cat cgc att atg get aat 4 32 

He Arg Tyr Met Gin Arg Thr Gin His He His Arg He Met Ala Asn 
130 135 140 

tat atg ccg cac aat aaa cgc age ggt gat tta ctg gcg cga ctg ggt 480 
Tyr Met Pro His Asn Lys Arg Ser Gly Asp Leu Leu Ala Arg Leu Gly 
145 150 .155 160 

ttt gaa aaa gaa ggc tat gcg aaa gac tat ctg ttg att gat gga caa 528 
Phe Glu Lys Glu Gly Tyr Ala Lys Asp Tyr Leu Leu He Asp Gly Gin 
165 170 175 

tgg cgc gat cac gta ctg acg gca tta act acc cca gac tgg acg ccc 576 
Trp Arg Asp His Val Leu Thr Ala Leu Thr Thr Pro Asp Trp Thr Pro 
180 185 190 

ggc cgc taa 585 
Gly Arg * 
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<210> 190 
<211> 648 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . . (648) 

<400> 190 

atg aaa tat caa ctt act gcc ctt gaa get cgc gtg att ggt tgc ttg 48 
Met Lys Tyr Gin Leu Thr Ala Leu Glu Ala Arg Val lie Gly Cys Leu 
15 10 15 

ctg gaa aag cag gtg acc acg ccg gag caa tac ccg etc tea gtc aat 96 
Leu Glu Lys Gin Val Thr Thr Pro Glu Gin Tyr Pro Leu Ser Val Asn 
20 25 30 

ggt gta gtc acg gcc tgt aat cag aaa acg aac cgt gaa ccg gtc atg 144 
Gly Val Val Thr Ala Cys Asn Gin Lys Thr Asn Arg Glu Pro Val Met 
35 40 4 5 

aat ctg agt gaa tec gaa gtg cag gaa cag ctg gat aat ctg gtc aaa 192 
Asn Leu Ser Glu Ser Glu Val Gin Glu Gin Leu Asp Asn Leu Val Lys 
50 55 60 

cgt cat tat eta cgc aca gtg age ggt ttt ggt aat egg gtc acc aaa 240 
Arg His Tyr Leu Arg Thr Val Ser Gly Phe Gly Asn Arg Val Thr Lys 
65 70 75 80 

tat gag caa cgt ttt tgt aat tea gaa ttt ggc gat ctg aaa ctg age 288 
Tyr Glu Gin Arg Phe Cys Asn Ser Glu Phe Gly Asp Leu Lys Leu Ser 
85 * 90 95 

gca gcg gaa gtg gcg tta ate acc acg ttg tta ttg cgt ggt gcc cag 336 
Ala Ala Glu Val Ala Leu lie Thr Thr Leu Leu Leu Arg Gly Ala Gin 
100 105 110 

acg cca ggt gaa ctg cgc age cgc gcc gcg cga atg tat gaa ttc age 384 
Thr Pro Gly Glu Leu Arg Ser Arg Ala Ala Arg Met Tyr Glu Phe Ser 
115 120 125 

gat atg gcg gaa gtg gag teg acg ctg gaa caa ctg gca aat cgc gaa 432 
Asp Met Ala Glu Val Glu Ser Thr Leu Glu Gin Leu Ala Asn Arg. Glu 
130 135 140 

gat ggt cct ttt gtg gtg cgt ctg gcc cgc gaa ccg ggt aaa cgc gaa 480 
Asp Gly Pro Phe Val Val Arg Leu Ala Arg Glu Pro Gly Lys Arg Glu 
145 ~ 150 155 160 

aac cgc tac atg cat ctt ttc agt ggt gag gtt gaa gat cag ccg gcg 528 
Asn Arg Tyr Met His Leu Phe Ser Gly Glu Val Glu Asp Gin Pro Ala 
165 170 ' 175 

gtg acg gat atg teg aac gcg gtt gac ggt gat tta cag gcc cgc gtc 57 6 

Val Thr Asp Met Ser Asn Ala Val Asp Gly Asp Leu Gin Ala Arg Val 
180 185 190 
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aaa aaa ctg acg ttg atg gtc ggt ttt aac cgt cgt ttc gca cca etc 
Lvs Lvs Leu Thr Leu Met Val Gly Phe Asn Arg Arg Phe Ala Pro Leu 
115 120 125 

tac ggt gag tta aaa acg caa etc gcc ace gca gcc teg eta aga atg 
Tyr Gly Glu Leu Lys Thr Gin Leu Ala Thr Ala Ala Ser Leu Arg Met 
130 135 140 

gat aaa cat cgt age aat age gtc ggg cca cac gat ctt tat ttc acg 
Asp Lys His Arg Ser Asn Ser Val Gly Pro His Asp Leu Tyr Phe Thr 
145 150 155 160 



624 



648 



gaa gcc ctg gaa ate gaa gtg gca gaa ctg aaa cag cgt ctt gat teg 
Glu Ala Leu Glu He Glu Val Ala Glu Leu Lys Gin Arg Leu Asp Ser 
195 200 205 

ttg ctg gcc cat ctg gga gat taa 
Leu Leu Ala His Leu Gly Asp * 
210 215 



<210> 191 

<211> 924 

<212> DNA 

<213> Escherichia coli 

<220> 
<221> CDS 
<222> (1) . . . (924) 

<400> 191 

gtg aaa aaa tta cgt ate ggc gta gtg gga tta ggt ggc att gcg caa 
Val Lys Lys Leu Arg He Gly Val Val Gly Leu Gly Gly lie Ala Gin 
I 5 10 15 

aaa gcg tgg tta ccg gtg ctg gcg gca gcg tct gac tgg acg tta caa 
Lvs Ala Trp Leu Pro Val Leu Ala Ala Ala Ser Asp Trp Thr Leu Gin 
20 25 30 

gga gcc tgg teg cct acg cgc gcg aaa gcc ctg cca att tgt gaa age 144 
Gly Ala Trp Ser Pro Thr Arg Ala Lys Ala Leu Pro He Cys Glu Ser 
35 40 45 

tgg cgc att cct tat gcc gat teg tta tec age ctt gcc gcc agt tgc 192 
■ TrD Are He Pro Tyr Ala Asp Ser Leu Ser Ser Leu Ala Ala Ser Cys 
50 55 60 

gat gcg gtt ttt gtg cat tec age ace gcc age cac ttt gac gtg gtc 
Asp Ala Val Phe Val His Ser Ser Thr Ala Ser His Phe Asp Val Val 
65 ™ 75 80 

agt acg tta etc aat gcg ggg gta cat gtc tgt gtc gat aaa ccg ctg 
Ser Thr Leu Leu Asn Ala Gly Val His Val Cys Val Asp Lys Pro Leu 
85 90 95 

gca gaa aat ctg cgc gat get gaa egg ctg gtg gaa ctg gcg gcg egg 336 
Ala Glu Asn Leu Arg Asp Ala Glu Arg Leu Val Glu Leu Ala Ala Arg 
100 105 HO 



48 



96 



240 



288 



384 



432 



480 
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ttg ctg gat gat tat ctg cat gtg gtg gat acc gcg ctg tgg ttg teg 
Leu Leu Asp Asp Tyr Leu His Val Val Asp Thr Ala Leu Trp Leu Ser 
165 170 175 



528 



ggc ggc aaa gcc tct ctg gat ggc ggt acg eta ctg act aac gac get 57 6 

Gly Gly Lys Ala Ser Leu Asp Gly Gly Thr Leu Leu Thr Asn Asp Ala 
180 185 190 



ggc gaa atg ctg ttt gcc gag cac cat ttt teg get ggt cct ttg cag 
Gly Glu Met Leu Phe Ala Glu His His Phe Ser Ala Gly Pro Leu Gin 
195 200 205 

ate acc acc tgt atg cat cgc cgt gcc gga agt cag cgt gaa acc gtg 
He Thr Thr Cys Met His Arg Arg Ala Gly Ser Gin Arg Glu Thr Val 
210 215 220 

cag gcc gtg act gac ggt gcg etc ate gac att acg gat atg cgc gaa 
Gin Ala Val Thr Asp Gly Ala Leu He Asp He Thr Asp Met Arg Glu 
225 230 235 240 

tgg cgt gag gag cgc ggg cag ggc gta gtg cat aaa ccg att cct ggt 
Trp Arg Glu Glu Arg Gly Gin Gly Val Val His Lys Pro He Pro Gly 
245 250 255 



ttc att gaa tgt gtg caa aac cag aca gtt ccg caa acc gcc ggc gaa 
Phe He Glu Cys Val Gin Asn Gin Thr Val Pro Gin Thr Ala Gly Glu 
275 280 285 

cag gcc gtg ctg gcg caa cgt ate gtt gac aag ate tgg cgc gat gcg 
Gin Ala Val Leu Ala Gin Arg He Val Asp Lys He Trp Arg Asp Ala 
290 295 300 

atg agt gaa taa 
Met Ser Glu * 
305 



<210> 192 
<211> 720 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . . (720) 

<400> 192 

atg tta aag cgc gtg ttc etc age ctg tta gtc ctg ate ggc ttg ctg 
Met Leu Lys Arg Val Phe Leu Ser Leu Leu Val Leu He Gly Leu Leu 
15 10 15 

ctg ttg act gtg etc ggc etc gat cgc tgg atg age tgg aaa acc gcg 
Leu Leu Thr Val Leu Gly Leu Asp Arg Trp Met Ser Trp Lys Thr Ala 
20 25 30 



624 



672 



720 



768 



tgg cag agt acg ctt gag caa cgt ggg ttt gtc ggc tgt gcg egg cac 816 
Trp Gin Ser Thr Leu Glu Gin Arg Gly Phe Val Gly Cys Ala Arg His 
260 265 270 



864 



912 



924 



48 



96 



cct tat ate tac gac gaa ttg cag gat etc ccc tac cgc cag gtc ggt 144 
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Pro Tyr He Tyr Asp Glu Leu Gin Asp Leu Pro Tyr Arg Gin Val Gly 
35 40 45 

gtg gtg etc gga aca gca aaa tat tat cgt act ggc gta att aat cag 
Val Val Leu Gly Thr Ala Lys Tyr Tyr Arg Thr Gly Val He Asn Gin 
50 55 60 

tat tat cgc tac cgc att caa gga gcg att aat gec tat aac age ggt 
Tvr Tyr Arg Tyr Arg He Gin Gly Ala lie Asn Ala Tyr Asn Ser Gly 
65 ' 70 75 80 

aag gta aat tat eta tta ctg age ggc gat aac gca ttg caa agt tat 
Lvs Val Asn Tyr Leu Leu Leu Ser Gly Asp Asn Ala Leu Gin Ser Tyr 
85 90 95 

aat gag ccg atg acc atg cgc aaa gat tta ate get get ggt gtc gac 
Asn Glu Pro Met Thr Met Arg Lys Asp Leu He Ala Ala Gly Val Asp 
100 105 HO 

cca tea gat att gtt etc gat tac gca ggc ttt cgt acg ctg gat tec 
Pro Ser Asp He Val Leu Asp Tyr Ala Gly Phe Arg Thr Leu Asp Ser 
115 120 125 

ate gtg cgt aca cgc aaa gtt ttc gat act aat gat ttc att att ate 
He Val Arg Thr Arg Lys Val Phe Asp Thr Asn Asp Phe He He He 
130 " 135 140 

acc caa cgt ttc cac tgt gag cga gca tta ttt att gcg ctg cat atg 
Thr Gin Arg Phe His Cys Glu Arg Ala Leu Phe He Ala Leu His Met 
145 150 . 155 160 

ggg att cag get cag tgt tat gee gta ccg tea ccg aaa gat atg ctg 
Gly He Gin Ala Gin Cys Tyr Ala Val Pro Ser Pro Lys Asp Met Leu 
165 170 175 



gtc aca ccc gaa cag tta ctt gaa tta caa aag aaa caa gga aag tag 
Val Thr Pro Glu Gin Leu Leu Glu Leu Gin Lys Lys Gin Gly Lys * 
225 230 235 



192 



240 



288 



336 



384 



432 



480 



528 



tea gta cgt att cgt gaa ttt gee gee cgt ttc ggt gcg ctg get gac 57 6 

Ser Val Arg He Arg Glu Phe Ala Ala Arg Phe Gly Ala Leu Ala Asp 

180 185 190 

ctt tat att ttt aaa cgt gaa ccg cgt ttt tta ggg ccg ctg gtc cct 624 

Leu Tyr He Phe Lys Arg Glu Pro Arg Phe Leu Gly Pro Leu Val Pro 

195 A 200 205 

att ccg get atg cac cag gta ccg gaa gat gcg cag ggg tat ccc gee 672 

He Pro Ala Met His Gin Val Pro Glu Asp Ala Gin Gly Tyr Pro Ala 

210 215 220 



720 



<210> 193 
<211> 240 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . . (240) 



-123- 



WO 01/34810 



PCT7USO0/30950 



<400> 193 

atg gat gta cag cag ttt ttt gtc gtt gcc gtt ttt ttc ctt ate ccg 48 

Met Asp Val Gin Gin Phe Phe Val Val Ala Val Phe Phe Leu He Pro 

15 10 15 

ata ttt tgt ttc cgc gaa gca tgg aaa ggc tgg cgc gca ggc gcg att 96 

He Phe Cys Phe Arg Glu Ala Trp Lys Gly Trp Arg Ala Gly Ala He 

20 25 30 

gat aaa egg gtt aaa aat gca ccg gaa ccg gtg tat gtc tgg cga gca 144 

Asp Lys Arg Val Lys Asn Ala Pro Glu Pro Val Tyr Val Trp Arg Ala 

35 40 45 

aaa aat ccc gga etc ttt ttc get tat atg gtg gca tat ate ggc ttc 192 

Lys Asn Pro Gly Leu Phe Phe Ala Tyr Met Val Ala Tyr He Gly Phe 

50 55 60 

gga att tta tct ate ggc atg att gtt tat ctt att ttc tat cgt taa 240 

Gly He Leu Ser He Gly Met He Val Tyr Leu He Phe Tyr Arg * 

65 70 75 



<210> 194 
<211> 1533 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . . (1533) 

<400> 194 

atg gcg acg ate gat tct atg aat aag gac acc aca cgt ttg age gat 48 

Met Ala Thr He Asp Ser Met Asn Lys Asp Thr Thr Arg Leu Ser Asp 

15 10 15 

gga ccc gac tgg acg ttc gac ctg ctg gat gtt tat ctg gca gag ata 96 
Gly Pro Asp Trp Thr Phe Asp Leu Leu Asp Val Tyr Leu Ala Glu He 
20 25 30 

gac egg gtg gcg aaa etc tac egg ctg gat acc tac ccg cac cag att 14 4 

Asp Arg Val Ala Lys Leu Tyr Arg Leu Asp Thr Tyr Pro His Gin He 
35 40 45 

gaa gtg ata acc tea gaa cag atg atg gat gcc tac tec age gtc ggc 192 
Glu Val He Thr Ser Glu Gin Met Met Asp Ala Tyr Ser Ser Val Gly 
50 55 60 

atg cca att aac tat ccg cac tgg tea ttc ggt aaa aag ttt ate gag 240 
Met Pro He Asn Tyr Pro His Trp Ser Phe Gly Lys Lys Phe He Glu 
65 70 75 80 

act gaa egg ctg tat aag cac ggt cag caa gga ctg gcc tat gaa ate 288 
Thr Glu Arg Leu Tyr Lys His Gly Gin Gin Gly Leu Ala Tyr Glu He 
85 90 95 

gtc att aac tct aac ccg tgt ate get tac ctg atg gaa gag aac acc 336 
Val He Asn Ser Asn Pro Cys He Ala Tyr Leu Met Glu Glu Asn Thr 
100 105 110 
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att acc atg caa gcg ctg gtg atg get cat gec tgc tat ggg cat aac 384 
He Thr Met Gin Ala Leu Val Met Ala His Ala Cys Tyr Gly His Asn 
115 120 125 

tct ttc ttc aaa aac aat tac tta ttc cgt age tgg acc gac gec agt 432 
Ser Phe Phe Lys Asn Asn Tyr Leu Phe Arg Ser Trp Thr Asp Ala Ser 
130 135 . 140 

teg att gtc gat tat ctg att ttt gec cgt aaa tat att acc gag tgc 480 
Ser He Val Asp Tyr Leu He Phe Ala Arg Lys Tyr He Thr Glu Cys 
145 150 155 160 

gaa gag cgt tat ggc gtt gat gaa gta gaa egg ctt ctg gac teg tgc 528 
Glu Glu Arg Tyr Gly Val Asp Glu Val Glu Arg Leu Leu Asp Ser Cys 
165 ' 170 175 

cac gcg ctg atg aac tac ggc gtg gac egg tac aaa cgc ccg caa aaa 576 
His Ala Leu Met Asn Tyr Gly Val Asp Arg Tyr Lys Arg Pro Gin Lys 
180 * - 185 190 

ate teg ctg caa gaa gag aaa gee egg cag aaa agt cgc gaa gag tat 624 
He Ser Leu Gin Glu Glu Lys Ala Arg Gin Lys Ser Arg Glu Glu Tyr 
195 200 205 

ctg caa agt cag gtc aat atg etc tgg cgt acc ctg ccg aag cgc gag 672 
Leu Gin Ser Gin Val Asn Met Leu Trp Arg Thr Leu Pro Lys Arg Glu 
210 215 220 

gaa gag aaa acg gtt get gaa gcg cgc cgc tat ccg tec gaa cca caa 720 
Glu Glu Lys Thr Val Ala Glu Ala Arg Arg Tyr Pro Ser Glu Pro Gin 
225 230 235 240 

gaa aac ctg etc tat ttt atg gag aaa aat gcg cca ctg ctg gaa tea 7 68 

Glu Asn Leu Leu Tyr Phe Met Glu Lys Asn Ala Pro Leu Leu Glu Ser 
245 250 255 

tgg cag cgt gaa ate ctg cgt att gtg cgt aag gtg age cag tat ttt 816 
Trp Gin Arg Glu He Leu Arg He Val Arg Lys Val Ser Gin Tyr Phe 
260 265 270 

tat ccg caa aaa cag act cag gtg atg aac gaa ggc tgg gcg acc ttc 864 
Tyr Pro Gin Lys Gin Thr Gin Val Met Asri Glu Gly Trp Ala Thr Phe 
275 280 285 

tgg cac tac acc ate ctt aac cat ctg tat gat gaa ggg aaa gta acg 912 
Trp His Tyr Thr He Leu Asn His Leu Tyr Asp Glu Gly Lys Val Thr 
290 295 300 



gaa cgt ttt atg ctg gag ttt ttg cac age cac acc aat gtg gtc ttc 
Glu Arg Phe Met Leu Glu Phe Leu His Ser His Thr Asn Val Val Phe 
305 310 315 320 



960 



cag ccc ccc tat aac age ccg tgg tac age ggc ate aac ccg tat gec 1008 
Gin Pro Pro Tyr Asn Ser Pro Trp Tyr Ser Gly He Asn Pro Tyr Ala 
325 330 335 

etc ggg ttc gec atg ttc cag gat att aaa egg att tgt cag teg cca 1056 
Leu Gly Phe Ala Met Phe Gin Asp He Lys Arg He Cys Gin Ser Pro 
340 345 350 
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acg gaa gaa gac aaa tac tgg ttc ccg gat ate gec ggt tec gac tgg 
Thr Glu Glu Asp Lys Tyr Trp Phe Pro Asp lie Ala Gly Ser Asp Trp 
355 360 365 



1104 



ctg gaa acg ctg cat ttc gcg atg cgt gat ttc aaa gat gag agt ttt 1152 
Leu Glu Thr Leu His Phe Ala Met Arg Asp Phe Lys Asp Glu Ser Phe 
370 375 380 



ate age cag ttc ctg tea ccg aaa gtg atg cgt gat ttc cgc ttc ttc 
lie Ser Gin Phe Leu Ser Pro Lys Val Met Arg Asp Phe Arg Phe Phe 
385 390 395 400 

ace gtg ctg gat gac gat egg cat aat tat ctg gag att tec get att 
Thr Val Leu Asp Asp Asp Arg His Asn Tyr Leu Glu lie Ser Ala He 
405 410 415 

cat aat gaa gaa ggt tat egg gag ate cgt aac egg tta teg teg caa 
His Asn Glu Glu Gly Tyr Arg Glu He Arg Asn Arg Leu Ser Ser Gin 
420 425 430 



gca ccg ctg gat egg ggg cgc aaa gaa gtc ctg aag cat gtg cat cgc 
Ala Pro Leu Asp Arg Gly Arg Lys Glu Val Leu Lys His Val His Arg 
465 470 475 480 



1200 



1248 



1296 



tat aac tta agt aat ctg gag ccg aat att cag ate tgg aac gtg gat 1344 
Tyr Asn Leu Ser Asn Leu Glu Pro Asn He Gin He Trp Asn Val Asp 
435 440 445 

ttg cgc ggc gac cgt teg ctg acg ctg cgt tat att cca cat aat cgc 1392 
Leu Arg Gly Asp Arg Ser Leu Thr Leu Arg Tyr He Pro His Asn Arg 
450 455 460 



1440 



ctg tgg gga ttt gat gtg atg etc gaa cag caa aac gaa gac ggc age 1488 
Leu Trp Gly Phe Asp Val Met Leu Glu Gin Gin Asn Glu Asp Gly Ser 
4 85 4 90 4 95 

ate gag ttq ctg gaa cgt tgc ccg cca aga atg gga aat ctg taa 1533 
He Glu Leu Leu Glu Arg Cys Pro Pro Arg Met Gly Asn Leu * 
500 505 510 



<210> 195 
<211> 1359 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . . (1539) 

<400> 195 

atg aaa aga eta tct ata acc gtc cgt tta acc ttg ctt ttt ata ttg 48 

Met Lys Arg Leu Ser He Thr Val Arg Leu Thr Leu Leu Phe He Leu 

15 10 15 

eta ctg tct gtt get ggc gec gga att gtc tgg act etc tat aat ggc 96 
Leu Leu Ser Val Ala Gly Ala Gly He Val Trp Thr Leu Tyr Asn Gly 
20 25 30 
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ctg gca agt gag ttg aaa tgg cgc gat gat aca aca etc att aac egg 144 
Leu Ala Ser Glu Leu Lys Trp Arg Asp Asp Thr Thr Leu He Asn Arg 
35 40 45 



aca gcg cag ate aag cag ttg tta att gat ggg gta aat cca gat acg 
Thr Ala Gin He Lys Gin Leu Leu He Asp Gly Val Asn Pro Asp Thr 
50 55 60 



gac gat etc get cat gaa ctt aga acg cca att aat gca tta ctg ggt 
Asp Asp Leu Ala His Glu Leu Arg Thr Pro He Asn Ala Leu Leu Gly 
245 250 255 



192 



384 



432 



tta cct gtg tac ttt aac egg atg atg gat gtt agt cag gat ate ttg 240 
Leu Pro Val Tyr Phe Asn Arg Met Met Asp Val Ser Gin Asp He Leu 
65 70 75 80 

att att cat ggt gat age ate aat aaa att gtt aac egg aca aat gtc 288 
He He His Gly Asp Ser He Asn Lys He Val Asn Arg Thr Asn Val 
85 90 95 

agt gat ggc atg tta aat aac ata cct get agt gag aca ate age gca 336 
Ser Asp Gly Met Leu Asn Asn He Pro Ala Ser Glu Thr He Ser Ala 
100 105 110 

get ggc att tac aga age att att aat gat aca gag ata gat get tta 
Ala Gly He Tyr Arg Ser He He Asn Asp Thr Glu He Asp Ala Leu 
115 120 125 

cga att aat att gat gaa gtt teg cca tea tta acg gtt act gtg get 
Arg He Asn He Asp Glu Val Ser Pro Ser Leu Thr Val Thr Val Ala 
130 135 140 

aaa ttg get tea gee aga cat aac atg ctt gaa cag tat aaa att aat 480 
Lys Leu Ala Ser Ala Arg His Asn Met Leu Glu Gin Tyr Lys He Asn 
145 150 155 160 

age att ata att tgc att gtc gee att gta ctt tgc tea gta tta agt 528 
Ser He He He Cys He Val Ala He Val Leu Cys Ser Val Leu Ser 
165 170 175 

ccg ctg tta ate aga acg gga tta cga gag ate aaa aag ttg agt ggt 576 
Pro Leu Leu He Arg Thr Gly Leu Arg Glu He Lys Lys Leu Ser Gly 
180 185 190 

gta acg gaa gcg ctg aat tat aac gat age cga gag cct gtt gag gtt 624 
Val Thr Glu Ala Leu Asn Tyr Asn Asp Ser Arg Glu Pro Val Glu Val 
195 200 205 

age gca tta ccg aga gaa eta aaa cct ctt ggg cag gcg ttg aat aaa 672 
Ser Ala Leu Pro Arg Glu Leu Lys Pro Leu Gly Gin Ala Leu Asn Lys 
210 215 220 

atg cat cat get tta gtc aaa gat ttt gag cgt eta agt cag ttt get 720 
Met His His Ala Leu Val Lys Asp Phe Glu Arg Leu Ser Gin Phe Ala 
225 230 235 240 



768 



cag aat cag gtt acg etc agt caa acc aga agt ate get gaa tat caa 816 
Gin Asn Gin Val Thr Leu Ser Gin Thr Arg Ser He Ala Glu Tyr Gin 
260 265 270 
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aaa aca att gcc gga aac att gaa gag ctg gaa aat att teg egg tta 8 64 

Lys Thr lie Ala Gly Asn lie Glu Glu Leu Glu Asn lie Ser Arg Leu 

275 280 285 

aca gag aac ata ctg ttt ctt gcc agg gca gat aaa aac aat gtt ttg 912 

Thr Glu Asn lie Leu Phe Leu Ala Arg Ala Asp Lys Asn Asn Val Leu 

290 295 300 

gtg aaa ctg gac teg ctt tct etc aat aag gaa gtc gaa aat ttg ttg 960 

Val Lys Leu Asp Ser Leu Ser Leu Asn Lys Glu Val Glu Asn Leu Leu 

305 310 315 320 

gac tat ctt gaa tac ctt tea gac gag aaa gag att tgc ttt aag gtc 1008 

Asp Tyr Leu Glu Tyr Leu Ser Asp Glu Lys Glu lie Cys Phe Lys Val 
325 330 335 

gag tgc aat cag caa ate ttt gcg gat aaa att tta eta caa cga atg 1056 

Glu Cys Asn Gin Gin lie Phe Ala Asp Lys lie Leu Leu Gin Arg Met 

340 345 350 

tta teg aat ctt att gtt aat gcc att aga tat teg cca gaa aaa teg 1104 

Leu Ser Asn Leu lie Val Asn Ala lie Arg Tyr Ser Pro Glu Lys Ser 

355 360 365 

cgt att cat ata acc agt ttt ctt gat acc aac age tat ctt aat att 1152 

Arg He His He Thr Ser Phe Leu Asp Thr Asn Ser Tyr Leu Asn He 

370 375 380 

gat ate gcc age cct gga acg aaa att aat gag cct gaa aaa etc ttc 1200 

Asp He Ala Ser Pro Gly Thr Lys He Asn Glu Pro Glu Lys Leu Phe 

385 390 395 400 

cgt aga ttt tgg egg gga gat aat teg cgt cat tec gta ggt cag gga 1248 

Arg Arg Phe Trp Arg Gly Asp Asn Ser Arg His Ser Val Gly Gin Gly 
405 410 415 

eta ggc ctt tct tta gtc aaa gcg att gcc gaa ttg cat ggg gga agt 1296 

Leu Gly Leu Ser Leu Val Lys Ala He Ala Glu Leu His Gly Gly Ser 

420 425 430 

get acg tat cac tat etc aat aag cat aat gtg ttc egg att acg tta 1344 

Ala Thr Tyr His Tyr Leu Asn Lys His Asn Val Phe Arg He Thr Leu 

435 440 445 

ccg caa aga aat taa 1359 
Pro Gin Arg Asn * 
450 



<210> 196 
<211> 720 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . . (720) 

<400> 196 

atg aac cag get gtt tct ata aca tat gat tta tgg cat att att ttc 48 
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Met Asn Gin Ala Val Ser He Thr Tyr Asp Leu Trp His lie He Phe 

15 10 15 

atg aag att eta ctt att gaa gat aat caa agg acc cag gaa tgg gta 96 

Met Lys lie Leu Leu He Glu Asp Asn Gin Arg Thr Gin Glu Trp Val 

20 25 30 

acg cag ggg ctt tec gaa gcg ggt tat gtc ate gat gee gtt tct gat 144 

Thr Gin Gly Leu Ser Glu Ala Gly Tyr Val He Asp Ala Val Ser Asp 

35 .40 45 

ggc aga gat ggg ctt tat ctt gcg ctg aag gat gat tat gca ttg ate 192 

Gly Arg Asp Gly Leu Tyr Leu Ala Leu Lys Asp Asp Tyr Ala Leu lie 

50 55 60 



att ctg gat att atg ctt ccg ggt atg gat ggc tgg cag ate tta caa 
He Leu Asp He Met Leu Pro Gly Met Asp Gly Trp Gin He Leu Gin 
65 70 75 80 



gca caa tta agg caa cat cac get ttg aat tea aca tta gaa ate age 
Ala Gin Leu Arg Gin His His Ala Leu Asn Ser Thr Leu Glu He Ser 
130 135 140 



agt att aca ctg acg cgc aag gag ttt cag tta ctt tgg eta ctg gee 
Ser He Thr Leu Thr Arg Lys Glu Phe Gin Leu Leu Trp Leu Leu Ala 
165 170 175 



cgc agg etc cgc gca aaa gtt gat gat cct ttt cct gaa aag eta att 
Arg Arg Leu Arg Ala Lys Val Asp Asp Pro Phe Pro Glu Lys Leu He 
210 215 220 



240 



acg tta aga aca gca aag caa acc cct gtt att tgc ctt act gca agg 288 
Thr Leu Arg Thr Ala Lys Gin Thr Pro Val He Cys Leu Thr Ala Arg 
85 < 90 95 

gat tct gtc gat gac aga gtc aga ggg ctg gac agt ggg gca aat gat 336 
Asp Ser Val Asp Asp Arg Val Arg Gly Leu Asp Ser Gly Ala Asn Asp 
100 105 110 

tat ctg gta aaa cct ttt tea ttt tct gag ttg ctg gca agg gtt egg 384 
Tyr Leu Val Lys Pro Phe Ser Phe Ser Glu Leu Leu Ala Arg Val Arg 
115 120 125 



432 



ggc tta aga atg gac tct gtt agt cat agt gtg age agg gac aat ate 4 80 

Gly Leu Arg Met Asp Ser Val Ser His Ser Val Ser Arg Asp Asn He 
145 150 155 160 



528 



tec aga get ggc gaa att ata ccc aga acg gtt att gcg agt gaa att 576 
Ser Arg Ala Gly Glu He He Pro Arg Thr Val He Ala Ser Glu He 
180 185 190 

tgg gga ate aac ttt gat agt gat acc aat acg gtg gac gtc gee att 624 
Trp Gly He Asn Phe Asp Ser Asp Thr Asn Thr Val Asp Val Ala lie 
195 200 205 



672 



gee aca ate egg ggg atg ggc tat tea ttc gta gcg gta aaa aaa taa 720 
Ala Thr He Arg Gly Met Gly Tyr Ser Phe Val Ala Val Lys Lys * 
225 ' 230 235 
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<210> 197 
<211> 519 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (I)... (519) 

<400> 197 

atg att ctt aaa tea get att tec get gac tct ctg etc get aag gat 48 

Met lie Leu Lys Ser Ala lie Ser Ala Asp Ser Leu Leu Ala Lys Asp 

15 10 15 

get ttt agg gca tec ttt cat tta cac ttt tta cga aat cat ggg ate 96 
Ala Phe Arg Ala Ser Phe His Leu His Phe Leu Arg Asn His Gly lie 
20 25 30 

act aac aaa ata teg ctt gtc agt tat att gta tgg cag gaa aga tat 144 
Thr Asn Lys lie Ser Leu Val Ser Tyr lie Val Trp Gin Glu Arg Tyr 
35 40 45 

gcg act gat att aca gat ccc caa agt gga gag ttt atg acc att aaa 192 
Ala Thr Asp lie Thr Asp Pro Gin Ser Gly Glu Phe Met Thr lie Lys 
50 55 60 

aat aag atg ttg ctg ggt gcg ctt ttg ctg gtt acc agt gee gee tgg 240 
Asn Lys Met Leu Leu Gly Ala Leu Leu Leu Val Thr Ser Ala Ala Trp 
65 70 75 80 

gee gca cca gee acc gcg ggt teg acc aat acc teg gga att tct aag 288 
Ala Ala Pro Ala Thr Ala Gly Ser Thr Asn Thr Ser Gly lie Ser Lys 
85 90 95 

tat gag tta agt agt ttc att get gac ttt aag cat ttc aaa cca ggg 336 
Tyr Glu Leu Ser Ser Phe lie Ala Asp Phe Lys His Phe Lys Pro Gly 
100 105 110 

gac acc gta cca gaa atg tac cgt acc gat gag tac aac att aag cag 384 
Asp Thr Val Pro Glu Met Tyr Arg Thr Asp Glu Tyr Asn lie Lys Gin 
115 120 125 

tgg cag ttg cgt aac ctg ccc gcg cct gat gee ggg acg cac tgg acc 432 
Trp Gin Leu Arg Asn Leu Pro Ala Pro Asp Ala Gly Thr His Trp Thr 
130 135 140 

tat atg ggt ggc gcg tac gtg ttg ate age gac acc gac ggt aaa ate 480 
Tyr Met Gly Gly Ala Tyr Val Leu lie Ser Asp Thr Asp Gly Lys lie 
145 150 155 160 

att aaa gee tac gac ggt gag att ttt tat cat cgc taa 519 
lie Lys Ala Tyr Asp Gly Glu He Phe Tyr His Arg * 
165 170 



<210> 198 
<211> 825 
<212> DNA 

<213> Escherichia coli 
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<220> 

<221> CDS 

<222> (1) . . . (825) 

<400> 198 

atg acc gaa ttt aca act ctt ctt cag caa gga aac gcc tgg ttc ttc 48 
Met Thr Glu Phe Thr Thr Leu Leu Gin Gin Gly Asn Ala Trp Phe Phe 
15 10 15 



ate ccc age gcc ate tta ctt ggt gcg ctt cat ggc ctg gaa cca ggg 
He Pro Ser Ala He Leu Leu Gly Ala Leu His Gly Leu Glu Pro Gly 
20 25 30 

cac tea aaa acg atg atg gcg gcg ttt ate ate gcc ate aaa ggc acc 
His Ser Lys Thr Met Met Ala Ala Phe He He Ala He Lys Gly Thr 
35 40 45 

att aaa caa gcg gtg atg etc gga ctg gca gca act att teg cat acc 
He Lys Gin Ala Val Met Leu Gly Leu Ala Ala Thr He Ser His Thr 
50 55 60 

gca gtg gtc tgg tta att gcc ttt ggc ggg atg gtg ate age aag cgc 
Ala Val Val Trp Leu He Ala Phe Gly Gly Met Val He Ser Lys Arg 
65 70 75 80 

ttt act get caa tea gca gaa ccg tgg etc cag ctg att tec gca gtg 
Phe Thr Ala Gin Ser Ala Glu Pro Trp Leu Gin Leu He Ser Ala Val 
85 90 95 

ate att att age acc gcg ttc tgg atg ttc tgg cgt acc tgg cgc ggc 
He He He Ser Thr Ala Phe Trp Met Phe Trp Arg Thr Trp Arg Gly 
100 105 HO 

gaa cgc aac tgg ctg gag aat atg cac ggg cat gat tat gag cat cat 
Glu Arg Asn Trp Leu Glu Asn Met His Gly His Asp Tyr Glu His His 
115 120 125 

cat cac gat cac gaa cat cac cac gac cat gga cat cat cac cat cac 
His His Asp His Glu His His His Asp His Gly His His His His His 
130 135 140 

gaa cat ggc gag tat cag gat gcc cat gca cga gcc cat gcc aat gac 
Glu His Gly Glu Tyr Gin Asp Ala His Ala Arg Ala His Ala Asn Asp 
145 150 155 160 

att aaa cga cgc ttt gat ggt aga gag gtc acc aac tgg caa att ttg 
He Lys Arg Arg Phe Asp Gly Arg Glu Val Thr Asn Trp Gin He Leu 
165 170 175 



96 



144 



192 



240 



288 



336 



384 



432 



480 



528 



tta ttt ggc tta acc ggt ggc ctt ate ccc tgc ccg gca gca att acc 576 
Leu Phe Gly Leu Thr Gly Gly Leu He Pro Cys Pro Ala Ala He Thr 
180 185 190 

gtg ctg ttg ata tgc att cag ttg aaa gcc ctg aca ctg ggc gca aca 624 
Val Leu Leu He Cys He Gin Leu Lys Ala Leu Thr Leu Gly Ala Thr 
195 200 205 

ctg gtc gtc agt ttc age att ggc ctg gcg tta acg ctt gtc acc gta 672 
Leu Val Val Ser Phe Ser He Gly Leu Ala Leu Thr Leu Val Thr Val 
210 215 220 
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ggc gtt ggc gca gca ate age gtt cag 
Gly Val Gly Ala Ala He Ser Val Gin 
225 230 

gga ttt aac act etc get aaa cgc gec 
Gly Phe Asn Thr Leu Ala Lys Arg Ala 
245 

att ggc tta gtc ggt gtg tat atg ggc 
He Gly Leu Val Gly Val Tyr Met Gly 
260 265 

atg cga taa 
Met Arg * 



cag gtc gca aaa cgc tgg age 720 
Gin Val Ala Lys Arg Trp Ser 
235 240 

ccc tat ttt tec agt ctg ttg 768 
Pro Tyr Phe Ser Ser Leu Leu 
250 255 

gta cat ggc ttc atg ggc ata 816 
Val His Gly Phe Met Gly He 
270 

825 



<210> 199 
<211> 249 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . - (249) 

<400> 199 

atg tgt att ggc gtt cca ggc cag gtg ctg get gtc ggt gaa gat att 48 

Met Cys He Gly Val Pro Gly Gin Val Leu Ala Val Gly Glu Asp He 
15 10 15 

cac cag ctt gcg cag gtt gaa gta tgt ggt ate aag cgc gat gtg aat 96 
His Gin Leu Ala Gin Val Glu Val Cys Gly He Lys Arg Asp Val Asn 
20 25 30 



ate gec ctg att tgt gaa ggt aac cct gec gat eta ctg ggc cag tgg 
He Ala Leu He Cys Glu Gly Asn Pro Ala Asp Leu Leu Gly Gin Trp 
35 40 45 



gec aaa gec aca tta gac gca ctg cgc caa atg gat tac gac att acc 
Ala Lys Ala Thr Leu Asp Ala Leu Arg Gin Met Asp Tyr Asp He Thr 
65 70 75 80 



144 



gtg ctg gta cac gtc gga ttt gec atg age ate ate gac gaa gat gaa 192 
Val Leu Val His Val Gly Phe Ala Met Ser lie lie Asp Glu Asp Glu 
50 * 55 60 



240 



age gcg tga 249 
Ser Ala * 



<210> 200 
<211> 342 
<212> DNA 

<213> Escherichia coli 

<220> 
<221> CDS 
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<222> (1) . . . (342) 
<400> 200 

atg cat gag ttg tct ctt tgc cag age gec gtt gaa att ate caa egg 48 
Met His Glu Leu Ser Leu Cys Gin Ser Ala Val Glu lie lie Gin Arg 
15 10 15 

cag gcg gag cag cac gat gtt aag cgc gtc ace gee gtg tgg ctg gaa 96 
Gin Ala Glu Gin His Asp Val Lys Arg Val Thr Ala Val Trp Leu Glu 
20 25 30 

att ggc gcg etc tec tgc gtt gag gag age gee gtc cgt ttt agt ttt 14 4 

He Gly Ala Leu Ser Cys Val Glu Glu Ser Ala Val Arg Phe Ser Phe 
35 40 45 

gaa att gtc tgc cac gga acg gtg gcg caa ggg tgc gat tta cat ate 192 
Glu He Val Cys His Gly Thr Val Ala Gin Gly Cys Asp Leu His He 
50 55 60 

gtc tat aaa ccc gee cag get tgg tgc tgg gat tgc age cag gtg gtg 240 
Val Tyr Lys Pro Ala Gin Ala Trp Cys Trp Asp Cys Ser Gin Val Val 
65 " 70 75 80 

gag att cat cag cac gat gcg cag tgt ccg etc tgt cac ggc gag egg 288 
Glu He His Gin His Asp Ala Gin Cys Pro Leu Cys His Gly Glu Arg 
85 90 95 

ttg cgt gtc gat acc ggc gat teg ctg ate gtc aaa agt att gaa gtt 336 
Leu Arg Val Asp Thr Gly Asp Ser Leu He Val Lys Ser He Glu Val 
100 105 110 

gaa taa 342 
Glu * 



<210> 201 
<211> 489 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . . (489) 

<400> 201 

atg act gaa gag ata gca ggt ttc cag acc tec ccg aag gcg caa gta 
Met Thr Glu Glu He Ala Gly Phe Gin Thr Ser Pro Lys Ala Gin Val 
15 10 15 



48 



cag gca gcg ttt gaa gaa att gee egg cgt teg atg cac gat etc tct 96 

Gin Ala Ala Phe Glu Glu He Ala Arg Arg Ser Met His Asp Leu Ser 
20 25 30 

ttt ctg cat cct tea atg ccg gtg tat gtt tct gat ttt acg ctg ttc 144 

Phe Leu His Pro Ser Met Pro Val Tyr Val Ser Asp Phe Thr Leu Phe 
35 40 45 

gaa ggt cag tgg acg ggg tgt gtg ate acc ccg tgg atg ctg agt gca 192 

Glu Gly Gin Trp Thr Gly Cys Val He Thr Pro Trp Met Leu Ser Ala 
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50 55 60 

gtt ate ttc ccc ggc ccg gat caa etc tgg ccg ctg cgc aaa gtg agt 240 
Val lie Phe Pro Gly Pro Asp Gin Leu Trp Pro Leu Arg Lys Val Ser 
65 70 75 80 

gaa aaa att ggt ctg caa ctg ccg tat ggc act atg acc ttt acc gtt 288 
Glu Lys lie Gly Leu Gin Leu Pro Tyr Gly Thr Met Thr Phe Thr Val 
85 90 95. 

ggc gaa ctg gac ggt gtt teg caa tat etc tec tgt teg ctg atg teg 336 
Gly Glu Leu Asp Gly Val Ser Gin Tyr Leu Ser Cys Ser Leu Met Ser 
100 105 110 

ccg ctt teg cac age atg teg att gaa gag ggc caa cgc ctg acg gat 384 
Pro Leu Ser His Ser Met Ser lie Glu Glu Gly Gin Arg Leu Thr Asp 
115 120 125 

gac tgc gca cga atg ate ctt teg ctg cca gtc acg aat ccg gat gta 4 32 

Asp Cys Ala Arg Met lie Leu Ser Leu Pro Val Thr Asn Pro Asp Val 
130 135 140 

cca cac gca ggg cgt cgc gec ctg ctg ttt ggt cgc agg agt ggc gaa 480 
Pro His Ala Gly Arg Arg Ala Leu Leu Phe Gly Arg Arg Ser Gly Glu 
145 1 150 155 160 

aat gca tga 409 
Asn Ala * 



<210> 202 
<211> 495 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . . (495) 

<400> 202 

atg cgt att tta gtc tta ggg gtc ggc aat att ttg ctg acc gat gaa 48 
Met Arg lie Leu Val Leu Gly Val Gly Asn lie Leu Leu Thr Asp Glu- 
15 10 15 

gee ate ggt gtg egg att gtc gaa gcg tta gag caa cga tac att ctg 96 
Ala lie Gly Val Arg lie Val Glu Ala Leu Glu Gin Arg Tyr He Leu 
20 25 30 

ccg gat tat gtt gag ate etc gat ggc ggc acg gcg gga atg gag ctg 144 
Pro Asp Tyr Val Glu He Leu Asp Gly Gly Thr Ala Gly Met Glu Leu 
35 40 45 

ctt ggc gac atg gca aat cgc gat cat ttg att att gcg gat gee att 192 
Leu Gly Asp Met Ala Asn Arg Asp His Leu He He Ala Asp Ala He 
50 55 60 

gtc teg aaa aag aac gcg ccg gga acg atg atg ate ctg egg gat gaa 240 
Val Ser Lys Lys Asn Ala Pro Gly Thr Met Met He Leu Arg Asp Glu 
65 70 75 80 



-134- 



WO 01/34810 



PCTYUS00/30950 



gaa gtt ccg gcg ttg ttt acc aac aaa ate tct ccg cat cag ctt ggc 288 
Glu Val Pro Ala Leu Phe Thr Asn Lys lie Ser Pro His Gin Leu Gly 
85 90 95 

ctg gec gac gtc ttg teg gec ctg cgc ttc acc ggc gag ttt ccg aaa 336 
Leu Ala Asp Val Leu Ser Ala Leu Arg Phe Thr Gly Glu Phe Pro Lys 
100 105 110 



, aag ctg acc ctg gtc ggc gtg ate ccg gaa teg ctg gag cca cac ate 

Lys Leu Thr Leu Val Gly Val lie Pro Glu Ser Leu Glu Pro His He 

115 120 125 

ggc tta acg ccg acg gtt gaa gca atg att gaa cct gcg ctt gag cag 

Gly Leu Thr Pro Thr Val Glu Ala Met He Glu Pro Ala Leu Glu Gin 

130 135 140 



gcg att cat gac tga 
Ala He His Asp * 



384 



432 



gtt ctg get gcg ctg cgt gaa tct ggc gtg gaa gec ate cca egg gag 4 80 

Val Leu Ala Ala Leu Arg Glu Ser Gly Val Glu Ala lie Pro Arg Glu 
145 150 155 160 



495 



<210> 203 
<211> 1704 
<212> DNA 

<213> Escherichia coli 



<220> 

<221> CDS 

<222> (1) . . . (1704) 

<400> 203 

atg age cag aga att act att gat ccg gta acc cgt att gag ggg cat 48 
Met Ser Gin Arg He Thr He Asp Pro Val Thr Arg He Glu Gly His 
15 10 15 

tta cgc ate gat tgc gaa ate gaa aat ggc gtc gtt teg aaa gca tgg 96 
Leu Arg He Asp Cys Glu He Glu Asn Gly Val Val Ser Lys Ala Trp 
20 25 30 

get tec ggt acc atg tgg cgc ggc atg gaa gag ate gtg aaa aac cgc 144 
Ala Ser Gly Thr Met Trp Arg Gly Met Glu Glu He Val Lys Asn Arg 
35 40 45 

gat ccg cgc gat gca tgg atg att gtg caa cgt ate tgt ggc gta tgt 192 
Asp Pro Arg Asp Ala Trp Met He Val Gin Arg He Cys Gly Val Cys 
50 55 60 

act acc act cac gcg ctg tct tec gtt cgt gcg gca gaa agt gcg ctg 240 
Thr Thr. Thr His Ala Leu Ser Ser Val Arg Ala Ala Glu Ser Ala Leu 
65 70 75 80 

aat ate gac gtt ccg gtt aac gcg caa tac ate cgt aac ate att ctg 288 
Asn He Asp Val Pro Val Asn Ala Gin Tyr He Arg Asn He He Leu 
85 90 95 
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atg tac ate aag tct ttc ate gac aaa ctg age gac ttt gtt gag cag 
Met Tyr He Lys Ser Phe He Asp Lys Leu Ser Asp Phe Val Glu Gin 
245 250 255 



ctg aca cgc ggt aaa ggt gcg gtg aac tac ctg age gtg ccg gaa ttc 
Leu Thr Arg Gly Lys Gly Ala Val Asn Tyr Leu Ser Val Pro Glu Phe 
275 280 285 



gag aat gcg gat ctg tec teg tat cgt ccg ate act tct cat tec gat 
Glu Asn Ala Asp Leu Ser Ser Tyr Arg Pro He Thr Ser His Ser Asp 
305 310 315 320 

gaa tac ctg ate aaa ggc att cag gaa age gcg aag cac tec tgg tat 
Glu Tyr Leu He Lys Gly He Gin Glu Ser Ala Lys His Ser Trp Tyr 
325 330 335 



432 



480 



528 



get gcg cac ace acg cat gac cat att gtt cat ttc tat cag ctt teg 336 
Ala Ala His Thr Thr His Asp His He Val His Phe Tyr Gin Leu Ser 
100 105 110 

gcg ctg gac tgg gtg gac ate act tct gca ctg caa get gac cca acc 384 
Ala Leu Asp Trp Val Asp He Thr Ser Ala Leu Gin Ala Asp Pro Thr 
115 120 125 

aaa gee tec gaa atg ctg aaa ggc gtt teg acc tgg cac ctg aac agt 
Lys Ala Ser Glu Met Leu Lys Gly Val Ser Thr Trp His Leu Asn Ser 
130 135 140 

ccg gaa gag ttc acc aaa gtt cag aac aag ate aaa gat ctg gtt gee 
Pro Glu Glu Phe Thr Lys Val Gin Asn Lys He Lys Asp Leu Val Ala 
145 150 155 160 

age ggt cag ttg ggt att ttc get aat ggc tac tgg ggt cat ccg gcg 
Ser Gly Gin Leu Gly He Phe Ala Asn Gly Tyr Trp Gly His Pro Ala 
165 170 175 

atg aaa ctg ccg ccg gaa gtg aac ctg att gcg gta gcg cac tac ctg 57 6 

Met Lys Leu Pro Pro Glu Val Asn Leu He Ala Val Ala His Tyr Leu 
180 185 190 

caa gcg ttg gag tgc cag cgt gac get aac cgc gtc gtg gcg ctg ctg 624 
Gin Ala Leu Glu Cys Gin Arg Asp Ala Asn Arg Val Val Ala Leu Leu 
195 * 200 205 

ggc ggt aaa acg ccg cac att cag aac ctg gcg gta ggt ggt gtc gcg 672 
Gly Gly Lys Thr Pro His He Gin Asn Leu Ala Val Gly Gly Val Ala 
210 215 220 

aac ccg ate aac etc gac ggt ttg ggc gtg ctg aac ctt gag cgc ctg 720 
Asn Pro He Asn Leu Asp Gly Leu Gly Val Leu Asn Leu Glu Arg Leu 
225 230 235 240 



768 



gtt tat aag gtt gat acc gca gtt att gee gcg ttc tac ccg gaa tgg 816 
Val Tyr Lys Val Asp Thr Ala Val He Ala Ala Phe Tyr Pro Glu Trp 
260 265 270 



864 



ccg acc gac agt aaa aac ggc age ttc ctg ttc ccg ggc ggc tac att 912 
Pro Thr Asp Ser Lys Asn Gly Ser Phe Leu Phe Pro Gly Gly Tyr He 
290 295 300 



960 



1008 
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aaa gac gaa gcg ccg cag gca ccg tgg gaa ggc acc acc att ccg get 
Lys Asp Glu Ala Pro Gin Ala Pro Trp Glu Gly Thr Thr He Pro Ala 
340 345 350 

tat gat ggt tgg tct gac gac ggg aaa tat tec tgg gtg aaa tea ccg 
Tyr Asp Gly Trp Ser Asp Asp Gly Lys Tyr Ser Trp Val Lys Ser Pro 
355 360 365 



cag ctg cac tec acg ctg ggc cgt att att ggt cgt acc gtt cac tgc 
Gin Leu His Ser Thr Leu Gly Arg He He Gly Arg Thr Val His Cys 
420 425 430 



ccg tgc atg gec tgt gcg gta cac gta gtg gat get gac ggc aac gaa 
Pro Cys Met Ala Cys Ala Val His Val Val Asp Ala Asp Gly Asn Glu 
545 550 555 560 

gtg gtt tea gtg aag gtt ctg taa 
Val Val Ser Val Lys Val Leu *. 
565 



1056 



1104 



act ttc tac ggc aaa acg gta gaa gtg ggg cca ctg get aat atg ctg 1152 
Thr Phe Tyr Gly Lys Thr Val Glu Val Gly Pro Leu Ala Asn Met Leu 
370 " 375 380 

gtg aaa ctg gcg gca ggt cgc gaa tct acc cag aac aaa ctg aat gaa 1200 
Val Lys Leu Ala Ala Gly Arg Glu Ser Thr Gin Asn Lys Leu Asn Glu 
385 390 395 400 

ate gtt gcg att tat cag aaa ctg act ggc aac acg ctg gaa gtg gca 1248 
He Val Ala He Tyr Gin Lys Leu Thr Gly Asn Thr Leu Glu Val Ala 
405 410 415 



1296 



tgt gaa ttg cag gat ate ctg caa aac caa tac agt gca ctg ate acc 1344 
Cys Glu Leu Gin Asp He Leu Gin Asn Gin Tyr Ser Ala Leu He Thr 
435 440 445 

aat ate ggc aaa ggc gat cac acc acc ttt gtg aag ccg aac att ccg 1392 
Asn He Gly Lys Gly Asp His Thr Thr Phe Val Lys Pro Asn He Pro 
450 455 460 



1440 



gca acg ggt gaa ttc aaa ggt gtt ggc ttc etc gaa gcg ccg cgc ggt 

Ala Thr Gly Glu Phe Lys Gly Val Gly Phe Leu Glu Ala Pro Arg Gly 
465 470 475 480 

atg etc tct cac tgg atg gtt att aaa gac ggt ate ate age aac tac 

Met Leu Ser His Trp Met Val He Lys Asp Gly He He Ser Asn Tyr 
485 490 495 

cag gcg gtt gtt cca tea acc tgg aac tct ggt ccg cgt aac ttc aat 

Gin Ala Val Val Pro Ser Thr Trp Asn Ser Gly Pro Arg Asn Phe Asn 
500 505 510 

gat gac gtc ggt cct tac gag cag teg ctg gtg ggt aca ccg gtt gec 
Asp Asp Val Gly Pro Tyr Glu Gin Ser Leu Val Gly Thr Pro Val Ala 
515 520 525 

gat ccg aat aaa ccg ctg gaa gtg gtg cgt acc att cac tec ttt gac 1632 

Asp Pro Asn Lys Pro Leu Glu Val* Val Arg Thr He His Ser Phe Asp 
530 535 540 



1488 



1536 



1584 



1680 



1704 
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<210> 204 
<211> 1179 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . . (1179) 

<400> 204 

atg agt cat gat cca caa ccg ctg ggc ggc aaa ate ate agt' aaa ccg 48 
Met Ser His Asp Pro Gin Pro Leu Gly Gly Lys lie lie Ser Lys Pro 
15 10 15 

gtc atg att ttt gga ccg tta ate gtc ate tgt atg etc ctg att gtg 96 
Val Met lie Phe Gly Pro Leu lie Val He Cys Met Leu Leu He Val 
20 ' 25 30 

aag cgt ctg gtg ttc ggt ctg ggc tct gtc tct gac ctg aac ggc ggc 144 
Lys Arg Leu Val Phe Gly Leu Gly Ser Val Ser Asp Leu Asn Gly Gly 
35 40 45 

ttc ccg tgg ggc gtg tgg ate gcg ttt gac ctg ctg att ggc acc ggc 192 
Phe Pro Trp Gly Val Trp He Ala Phe Asp Leu Leu He Gly Thr Gly 
50 "* 55 60 

ttt gee tgt ggc ggc tgg gcg ctg gcg tgg gcg gta tac gtc tit aac 240 
Phe Ala Cys Gly Gly Trp Ala Leu Ala Trp Ala Val Tyr Val Phe Asn 
65 70 75 80 

cgt ggg caa tac cat ccg ctg gtg cgt ccg gcg ctg ttg gcg agt ctg 288 
Arg Gly Gin Tyr His Pro Leu Val Arg Pro Ala Leu Leu Ala Ser Leu 
85 90 95 

ttt ggt tac tea ctg ggt ggc ttg teg ate act ate gac gtg ggt cgc 336 
Phe Gly Tyr Ser Leu Gly Gly Leu Ser He Thr He Asp Val Gly Arg 
100 105 110 

tac tgg aac ctg ccg tac ttc tac att ccg ggt cac ttc aac gtg aac 384 
Tyr Trp Asn Leu Pro Tyr Phe Tyr He Pro Gly His Phe Asn Val Asn 
115 120 125 

teg gta ctg ttc gag acg gcg gtc tgt atg acc ate tat ate ggc gtg 4 32 

Ser Val Leu Phe Glu Thr Ala Val Cys Met Thr He Tyr He Gly Val 
130 135 140 

atg gca ctg gag ttt get ccg gca ctg ttt gaa cgt ctg ggg tgg aag 480 
Met Ala Leu Glu Phe Ala Pro Ala Leu Phe Glu Arg Leu Gly Trp Lys 
145 150 155 160 

gtg teg eta cag cga eta aac aag gtg atg ttc ttc ate ate gcg etc 528 
Val Ser Leu Gin Arg Leu Asn Lys Val Met Phe Phe He He Ala Leu 
165 170 175 

ggt gcg ctg ctg ccg acc atg cac cag tct tea atg ggg teg ctg atg 57 6 

Gly Ala Leu Leu Pro Thr Met His Gin Ser Ser Met Gly Ser Leu Met 
180 185 190 

ate teg gcg ggc tac aag gtg cat ccg ttg tgg cag age tat gaa atg 624 
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He Ser Ala Gly Tyr Lys Val His Pro Leu Trp Gin Ser Tyr Glu Met 
195 200 205 

ttg ccg ctg ttc teg ctg ctg acg gcg ttc ate atg ggc ttc teg att 672 
Leu Pro Leu Phe Ser Leu Leu Thr Ala Phe He Met Gly Phe Ser He 
210 215 220 



gtc ate ttt gaa ggt teg ctg gtg cag gcg ggt ctg cgt ggc aac ggt 
Val He Phe Glu Gly Ser Leu Val Gin Ala Gly Leu Arg Gly Asn Gly 
225 230 235 240 



720 



ccg gat gaa aag agt ctg ttt gtt aag ctg acc aac acc ate agt gtg 7 68 

Pro Asp Glu Lys Ser Leu Phe Val Lys Leu Thr Asn Thr He Ser Val 
245 250 255 

ttg ctg gcg att ttc ate gtg ctg cgc ttt ggc gag ctt ate tat cgc 816 
Leu Leu Ala He Phe He Val Leu Arg Phe Gly Glu Leu He Tyr Arg 
260 265 270 

gac aag ctg teg tta gcg ttt gee ggt gac ttc tac tec gtg atg ttc 864 
Asp Lys Leu Ser Leu Ala Phe Ala Gly Asp Phe Tyr Ser Val Met Phe 
275 280 " 285 

tgg att gaa gtc ctg ctg atg etc ttc ccg ctg gtc gtt ctg cgt gtg 912 
Trp He Glu Val Leu Leu Met Leu Phe Pro Leu Val Val Leu Arg Val 
290 295 300 

gcg aag ctg cgt aat gat tec cgc atg ctg ttc ctg tea gca ctg age 960 
Ala Lys Leu Arg Asn Asp Ser Arg Met Leu Phe Leu Ser Ala Leu Ser 
305 310 315 320 

gca ctg tta ggt tgt gca acc tgg cgt ctg acc tat teg ctg gtg gca 1008 
Ala Leu Leu Gly Cys Ala Thr Trp Arg Leu Thr Tyr Ser Leu Val Ala 
325 330 335 

ttc aac ccg ggc ggc ggt tac gec tac ttc ccg acc tgg gaa gaa ctg 1056 
Phe Asn Pro Gly Gly Gly Tyr Ala Tyr Phe Pro Thr Trp Glu Glu Leu 
340 345 350 

ttg att tct att ggt ttt gtg get att gag att tgc get tac ate gta 1104 
Leu He Ser He Gly Phe Val Ala He Glu He Cys Ala Tyr He Val 
355 360 365 

etc att cgt eta ctg ccg ata ctt cct cct tta aaa caa aac gat cat 1152 
Leu He Arg Leu Leu Pro He Leu Pro Pro Leu Lys Gin Asn Asp His 
370 375 380 

aat cgt cat gag gcg age aaa gca tga 1179 
Asn Arg His Glu Ala Ser Lys Ala * 
385 390 



<210> 205 
<211> 987 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . . (987) 
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<400> 205 

gtg aac aga cgt aat ttt att aaa gca gcc tec tgc ggg gca ttg ctg 48 

Met Asn Arg Arg Asn Phe He Lys Ala Ala Ser Cys Gly Ala Leu Leu 
15 10 15 

acg ggc gcg ctg ccg tct gtc agt cat gcg get get gaa aac cgc ccg 96 

Thr Gly Ala Leu Pro Ser Val Ser His Ala Ala Ala Glu Asn Arg Pro 
20 25 30 

cca att ccg gga teg ctg ggg atg ttg tac gac teg acc ttg tgc gta 144 

Pro He Pro Gly Ser Leu Gly Met Leu Tyr Asp Ser Thr Leu Cys Val 
35 40 45 



ggc tgc cag get tgc gtc acc aag tgt cag gat ate aat ttc cct gaa 
Gly Cys Gin Ala Cys Val Thr Lys Cys Gin Asp He Asn Phe Pro Glu 
50 55 60 



ccg tat acc aat aac ate att cag gtg tgg acc age ggc aca ggg gtc 
Pro Tyr Thr Asn Asn He He Gin Val Trp Thr Ser Gly Thr Gly Val 
85 90 95 



aag tac gac tac aac aac ccg ttt ggt gcg ctg cat aag tgc gag ctg 
Lys Tyr Asp Tyr Asn Asn Pro Phe Gly Ala Leu His Lys Cys Glu Leu 
165 170 175 



192 



cgt aac ccg caa ggg gaa cag acc tgg teg aac aac gac aaa ctg teg 24 0 

Arg Asn Pro Gin Gly Glu Gin Thr Trp Ser Asn Asn Asp Lys Leu Ser 
65 70 75 80 



288 



aac aaa gac cag gag gag aac ggc tac gcg tac att aag aaa cag tgt 336 

Asn Lys Asp Gin Glu Glu Asn Gly Tyr Ala Tyr He Lys Lys Gin Cys 
100 105 110 

atg cac tgc gtc gat ccg aac tgt gtc tct gtg tgc ccg gtc tct gca 384 

Met His Cys Val Asp Pro Asn Cys Val Ser Val Cys Pro Val Ser Ala 
115 120 125 

ctg aaa aaa gat ccg aaa acc ggc att gtc cat tac gac aaa gat gtg 4 32 

Leu Lys Lys Asp Pro Lys Thr Gly lie Val His Tyr Asp Lys Asp Val 

130 * 135 140 

tgc acc ggc tgc cgt tac tgc atg gtc gcc tgt ccg tac aac gtg ccg 4 80 

Cys Thr Gly Cys Arg Tyr Cys Met Val Ala Cys Pro Tyr Asn Val Pro 

145 150 155 160 



528 



tgc aac cag aaa ggt gtg gaa cgt etc gat aaa ggc ggt eta cct ggc 576 
Cys Asn Gin Lys Gly Val Glu Arg Leu Asp Lys Gly Gly Leu Pro Gly 
180 185 190 

tgc gta gaa gtg tgc ccg gcg ggc gcg gtg att ttc ggt acg cgt gaa 624 
Cys Val Glu Val Cys Pro Ala Gly Ala Val He Phe Gly Thr Arg Glu 
195 200 205 

gag ctg atg gcg gag gcg aaa aaa cgt ctg gcg ctg aag cct ggc age 672 
Glu Leu Met Ala Glu Ala Lys Lys Arg Leu Ala Leu Lys Pro Gly Ser 
210 215 220 

gaa tac cac tat ccg cgt cag acg ctg aaa tct ggc gac act tac ctg 720 
Glu Tyr His Tyr Pro Arg Gin Thr Leu Lys Ser Gly Asp Thr Tyr Leu 
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225 230 235 240 

cat acg gtg ccg aaa tat tat ccg cat ctg tac ggc gag aaa gag ggc 768 
His Thr Val Pro Lys Tyr Tyr Pro His Leu Tyr Gly Glu Lys Glu Giy 
245 250 255 

ggc ggt act cag gtt ctg gta ctg acg ggt gtg cct tat gaa aat etc 816 
Gly Gly Thr Gin Val Leu Val Leu Thr Gly Val Pro Tyr Glu Asn Leu 
260 265 270 

gac ctg ccg aaa ctg gac gat ctt tct acc ggt gcg cgt tec gaa aat 8 64 

Asp Leu Pro Lys Leu Asp Asp Leu Ser Thr Gly Ala Arg Ser Glu Asn 
275 ' 280 285 

att caa cac acc ctg tat aaa ggc atg atg eta cca ctg get gtg ctg 912 
He Gin His Thr Leu Tyr Lys Gly Met Met Leu Pro Leu Ala Val Leu 
290 295 300 

gcg ggc tta acc gtg ctg gtt cgt cgc aac acc aaa aac gac cat cac 960 
Ala Gly Leu Thr Val Leu Val Arg Arg Asn Thr Lys Asn Asp His His 
305 310 315 320 

gac gga gga gac gat cat gag tea tga 987 
Asp Gly Gly Asp Asp His Glu Ser * 
325 



<210> 206 
<211> 2475 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . . (2475) 

<400> 206 

atg eta caa tgt ggc gcg aag aat gtt aac cct ctg gag cgt ttt gtg 48 
Met Leu Gin Cys Gly Ala Lys Asn Val Asn Pro Leu Glu Arg Phe Val 
1 5 10 15 

teg teg ttg ccc gtt get gee gtc tta cct gaa tta ctt acc get etc 96 
Ser Ser Leu Pro Val Ala Ala Val Leu Pro Glu Leu Leu Thr Ala Leu 
20 25 30 

gat tgt gcg ccg cag gta tta tta agt gcg ccg acc ggg gec ggg aaa 144 
Asp Cys Ala Pro Gin Val Leu Leu Ser Ala Pro Thr Gly Ala Gly Lys 
35 40 . 45 

tea acc tgg ctg ccg ctg caa ctg ctg gcg cat ccc ggc att aac ggg 192 
Ser Thr Trp Leu Pro Leu Gin Leu Leu Ala His Pro Gly lie Asn Gly 
50 55 60 

aaa att ate ctg ctg gag ccg cgt cgt ctg gcg gcg cgt aac gtc gcg 240 
Lys He He Leu Leu Glu Pro Arg Arg Leu Ala Ala Arg Asn Val Ala 
65 70 . 75 80 

caa egg ctg gcg gag ctg ctt aac gaa aag cca ggc gat acc gtt ggc 288 
Gin Arg Leu Ala Glu Leu Leu Asn Glu Lys Pro Gly Asp Thr Val Gly 
85 90 95 
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tac egg atg cgt gcg caa aac tgc gtc ggg ccg aat acc cgc ctg gaa 336 
Tyr Arg Met Arg Ala Gin Asn Cys Val Gly Pro Asn Thr Arg Leu Glu 
100 105 110 

gtg gtt acc gaa ggc gtg ctg acg cgc atg ate cag cgt gac ccg gaa 384 
Val Val Thr Glu Gly Val Leu Thr Arg Met lie Gin Arg Asp Pro Glu 
115 " 120 125 

ctg age ggt gtt gga ctg gtg ate ctt gat gaa ttt cat gag cgc age 432 
Leu Ser Gly Val Gly Leu Val lie Leu Asp Glu Phe His Glu Arg Ser 
130 135 140 

ttg cag gcg gat ttg gcg ttg gcg ctg tta etc gat gtg caa caa ggt 480 
Leu Gin Ala Asp Leu Ala Leu Ala Leu Leu Leu Asp Val Gin Gin Gly 
145 * 150 155 160 

ctg cgt gat gac ctt aaa ctg ctg att atg teg get acc ctg gac aac 528 
Leu Arg Asp Asp Leu Lys Leu Leu lie Met Ser Ala Thr Leu Asp Asn 
165 170 175 

gac cgc ttg cag caa atg ctg cca gaa gcg cct gtc gtc ate tea gaa 576 
Asp Arg Leu Gin Gin Met Leu Pro Glu Ala Pro Val Val lie Ser Glu 
180 185 190 

ggg cgc teg ttt ccg gtt gaa cgc cgt tat tta ccg ctg ccc gcg cat 624 
Gly Arg Ser Phe Pro Val Glu Arg Arg Tyr Leu Pro Leu Pro Ala His 
195 200 205 

cag cgt ttt gac gat gee gtt gcg gta gee acc get gaa atg ctg cgt 672 
Gin Arg Phe Asp Asp Ala Val Ala Val Ala Thr Ala Glu Met Leu Arg 
210 215 220 

cag gaa age gga tea tta ctg tta ttt tta cct ggc gtc gga gaa att 720 
Gin Glu Ser Gly Ser Leu Leu Leu Phe Leu Pro Gly Val Gly Glu lie 
225 ' 230 235 240 

cag cgt gtg cag gaa caa ctg get teg cgc ate ggc agt gat gta ttg 768 
Gin Arg Val Gin Glu Gin Leu Ala Ser Arg lie Gly Ser Asp Val Leu 
245 250 255 

etc tgc ccg ctg tat ggc gcg ttg teg ctg aac gat cag cga aaa gcg 816 
Leu Cys Pro Leu Tyr Gly Ala Leu Ser Leu Asn Asp Gin Arg Lys Ala 
260 265 270 

ate etc ccg gca ccg caa ggg atg cgc aaa gtg gtg ctg gcg acc aat 864 
lie Leu Pro Ala Pro Gin Gly Met Arg Lys Val Val Leu Ala Thr Asn 
275 280 285 

att get gaa acc agt tta acc att gaa ggt att cgt ctg gtg gtg gat 912 
lie Ala Glu Thr Ser Leu Thr He Glu Gly He Arg Leu Val Val Asp 
290 295 300 



tgt gee cag gag cgt gtg gcg cgt ttt gat ccg cgc acg ggg ctt acg 
Cys Ala Gin Glu Arg Val Ala Arg Phe Asp Pro Arg Thr Gly Leu Thr 
305 310 315 320 



960 



cga ctg att act caa cgc gtt age cag gca tec atg acg cag cgt gee 1008 
Arg Leu He Thr Gin Arg Val Ser Gin Ala Ser Met Thr Gin Arg Ala 
325 330 335 
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ggg cgc gcc ggg cgt ctg gag ccg ggt ate age ctg cat tta ate gee 
Gly Arg Ala Gly Arg Leu Glu Pro Gly lie Ser Leu His Leu lie Ala 
340 345 350 



1056 



aaa gaa caa gca gaa cgc gcc gcg gcg caa agt gaa ccg gag ate tta 1104 
Lys Glu Gin Ala Glu Arg Ala Ala Ala Gin Ser Glu Pro Glu He Leu 
355 360 365 

caa age gat ctt tec ggt ttg ctg atg gaa tta ctg caa tgg gga tgc 1152 
Gin Ser Asp Leu Ser Gly Leu Leu Met Glu Leu Leu Gin Trp Gly Cys 
370 375 380 



age gat ccg gcg cag atg age tgg ctg gat caa ccg cca gta gtg aat 
Ser Asp Pro Ala Gin Met Ser Trp Leu Asp Gin Pro Pro Val Val Asn 
385 390 395 400 



gaa egg ctt agt gcg caa ggg caa aaa atg gca gcg ctg ggt aac gat 
Glu Arg Leu Ser Ala Gin Gly Gin Lys Met Ala Ala Leu Gly Asn Asp 
420 425 430 



ggc aat agt gac ctg ggc gtg gcg ttt teg cgc aat caa cca gcc tgg 
Gly Asn Ser Asp Leu Gly Val Ala Phe Ser Arg Asn Gin Pro Ala Trp 
465 * 470 475 480 

cag caa cgt agt cag caa ctg tta aaa cgc tta aac gta cgt ggc ggt 
Gin Gin Arg Ser Gin Gin Leu Leu Lys Arg Leu Asn Val Arg Gly Gly 
485 490 495 



gat cgc att get cgt cgc cgt ggg caa gat gga cgc tat caa ctg gca 
Asp Arg He Ala Arg Arg Arg Gly Gin Asp Gly Arg Tyr Gin Leu Ala 
515 520 525 

aac ggc atg gga gcg atg etc gat gcc aac gac gcg eta age cgc cac 
Asn Gly Met Gly Ala Met Leu Asp Ala Asn Asp Ala Leu Ser Arg His 
530 535 540 

gaa tgg ttg ate gca ccg tta tta ttg cag ggc age gcc teg ccg gat 
Glu Trp Leu He Ala Pro Leu Leu Leu Gin Gly Ser Ala Ser Pro Asp 
545 550 555 560 



1200 



eta ctg gcc gcg aaa cgt ctg tta caa atg ctg ggg gca ctg gag ggt 1248 
Leu Leu Ala Ala Lys Arg Leu Leu Gin Met Leu Gly Ala Leu Glu Gly 
405 410 415 



1296 



ccg cgt tta gcg gca atg ctg gtt age gcg aag aac gac gac gaa get 1344 
Pro Arg Leu Ala Ala Met Leu Val Ser Ala Lys Asn Asp Asp Glu Ala 
435 440 445 

get ace gcg gca aaa att gcc gcc att etc gaa gag ccg cca egg atg 1392 
Ala Thr Ala Ala Lys He Ala Ala He Leu Glu Glu Pro Pro Arg Met 
450 455 460 



1440 



1488 



gag gca gac agt teg ctt ate gcg ccg eta ctt gcc ggg gcg ttt gcc 1536 
Glu Ala Asp Ser Ser Leu He Ala Pro Leu Leu Ala Gly Ala Phe Ala 
500 505 510 



1584 



1632 



1680 



gcg egg att tta ctg gcg ctg ctg gtc gat att gat gag tta gta caa 1728 
Ala Arg He Leu Leu Ala Leu Leu Val Asp He Asp Glu Leu Val Gin 
565 570 575 
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cgc tgc ccg cag ctg gta cag cag tct gac act gtg gag tgg gat gac 1776 
Arg Cys Pro Gin Leu Val Gin Gin Ser Asp Thr Val Glu Trp Asp Asp 
580 585 590 

gcg caa ggt acg ctg aaa gcc tgg cgt egg eta caa ate ggt cag ttg 1824 
Ala Gin Gly Thr Leu Lys Ala Trp Arg Arg Leu Gin lie Gly Gin Leu 
595 600 605 

acg gtg aaa gtg cag ccg ctg gcg aaa ccg tea gaa gac gag ttg cat 1872 
Thr Val Lys Val Gin Pro Leu Ala Lys Pro Ser Glu Asp Glu Leu His 
610 615 ~ 620 

cag gcg atg ctt aat ggc ate cgt gat aaa ggt tta age gtg etc aac 1920 
Gin Ala Met Leu Asn Gly He Arg Asp Lys Gly Leu Ser Val Leu Asn 
625 630 635 640 

tgg acg gcg gaa gcg gaa cag eta cgc ttg cgt ttg tta tgc gcc gca 1968 
Trp Thr Ala Glu Ala Glu Gin Leu Arg Leu Arg Leu Leu Cys Ala Ala 
645 650 655 

aag tgg ttg ccg gaa tat gac tgg cca gcg gtt gat gat gaa agt tta 2016 
Lys Trp Leu Pro Glu Tyr Asp Trp Pro Ala Val Asp Asp Glu Ser Leu 
660 665 670 

ttg gca gcg ctg gaa acg tgg ctg ctg cca cat atg act ggc gta cat 2064 
Leu Ala Ala Leu Glu Thr Trp Leu Leu Pro His Met Thr Gly Val His 
675 680 685 

tea eta cgc ggc ctg aaa tea etc gac att tat cag gca eta cgc gga 2112 
Ser Leu Arg Gly Leu Lys Ser Leu Asp He Tyr Gin Ala Leu Arg Gly 
690 695 700 

tta ctt gat tgg gga atg cag caa cgt ctg gat agt gaa ttg cct gcg 2160 
Leu Leu Asp Trp Gly Met Gin Gin Arg Leu Asp Ser Glu Leu Pro Ala 
705 " 710 715 720 

cat tac act gtg ccg acg gga age egg ate gcc att cgt tat cat gaa 2208 
His Tyr Thr Val Pro Thr Gly Ser Arg He Ala He Arg Tyr His Glu 
725 730 735 

gat aac ccg ccc gcg ctg gcg gtg aga atg caa gag atg ttt ggc gag 2256 
Asp Asn Pro Pro Ala Leu Ala Val Arg Met Gin Glu Met Phe Gly Glu 
740 745 750 

gcc ace aat ccg acg ate gcc cag ggg cgc gtg ccg ctg gtg ctg gag 2304 
Ala Thr Asn Pro Thr He Ala Gin Gly Arg Val Pro Leu Val Leu Glu 
755 760 765 

ttg ctt tea cct gcc caa agg cca tta caa ate aca cga gat ttg age 2 352 

Leu Leu Ser Pro Ala Gin Arg Pro Leu Gin He Thr Arg Asp Leu Ser 
770 775 780 

gac ttc tgg aaa gga gcg tac cgt gag gtg caa aaa gag atg aaa ggg 2400 
Asp Phe Trp Lys Gly Ala Tyr Arg Glu Val Gin Lys Glu Met Lys Gly 
785 790 795 800 

cgt tat ccc aaa cat gtc tgg ccg gac gac ccg gca aat act gca ccg 24 48 
Arg Tyr Pro Lys His Val Trp Pro Asp Asp Pro Ala Asn Thr Ala Pro 
805 810 815 
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acg cga egg acg aaa aag tat teg taa 2475 
Thr Arg Arg Thr Lys Lys Tyr Ser * 
820 



<210> 207 
<211> 951 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . . (951) 



<400> 207 

atg agt aaa ctt gat act ttt ate caa cat get gta aac get gtt ccg 

Met Ser Lys Leu Asp Thr Phe He Gin His Ala Val Asn Ala Val Pro 
15 10 15 

gtc agt ggc aca tct ttg ate tec tct ctg tat ggt gat teg ctt tec 

Val Ser Gly Thr Ser Leu He Ser Ser Leu Tyr Gly Asp Ser Leu Ser 
20 25 30 



aat aaa gaa ggc tgg ctg gat gtt tec cgc ate ggg cga cgc agt ttc 
Asn Lys Glu Gly Trp Leu Asp Val Ser Arg He Gly Arg Arg Ser Phe 
65 * 70 75 80 

tat age etc agt gat aaa ggc ttg cgc ctg acg cga egg gca gaa agt 
Tyr Ser Leu Ser Asp Lys Gly Leu Arg Leu Thr Arg Arg Ala Glu Ser 
85 90 95 

aaa att tat cgc gca gag caa cct gca tgg gat ggt aaa tgg etc ctg 
Lys He Tyr Arg Ala Glu Gin Pro Ala Trp Asp Gly Lys Trp Leu Leu 
100 105 HO 

ttg etc teg gaa ggt tta gat aaa tea acg ctg get gat gtc aaa aag 
Leu Leu Ser Glu Gly Leu Asp Lys Ser Thr Leu Ala Asp Val Lys Lys 
115 120 125 

cag ttg ate tgg caa ggt ttt ggc gca ctg gca ccc age ctg atg gca 
Gin Leu He Trp Gin Gly Phe Gly Ala Leu Ala Pro Ser Leu Met Ala 
130 ~ 135 140 

teg ccg teg caa aaa ctg gee gat gta cag aca ctt ttg cat gaa gcg 
Ser Pro Ser Gin Lys Leu Ala Asp Val Gin Thr Leu Leu His Glu Ala 
145 150 155 160 

ggt gtg gcg gat aac gtg att tgt ttt gaa gcg caa ata cca ctg gcg 
Gly Val Ala Asp Asn Val He Cys Phe Glu Ala Gin He Pro Leu Ala 
165 170 175 



48 



96 



cat cgt ggt ggt gaa ate tgg ttg ggt agt ctg get get ttg ctg gaa 144 
His Arg Gly Gly Glu lie Trp Leu Gly Ser Leu Ala Ala Leu Leu Glu 
35 40 45 

ggg ctg gga ttt ggt gag cgt ttc gtg cgc ace get ttg ttt cgt ctt 192 
Gly Leu Gly Phe Gly Glu Arg Phe Val Arg Thr Ala Leu Phe Arg Leu 
50 "* 55 60 



240 



288 



336 



384 



432 



480 



528 
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ctt tct cgc gca gca ctg cgt gcc aga gta gaa gag tgc tgg cat tta 57 6 

Leu Ser Arg Ala Ala Leu Arg Ala Arg Val Glu Glu Cys Trp His Leu 
180 185 190 

act gaa caa aat gcc atg tac gaa acc ttt att cag tea ttc cgc ccg 624 
Thr Glu Gin Asn Ala Met Tyr Glu Thr Phe lie Gin Ser Phe Arg Pro 
195 200 205 

ctg gtg ccg ctt tta aaa gag gcg gca gac gag tta acc ccg gag egg 672 
Leu Val Pro Leu Leu Lys Glu Ala Ala Asp Glu Leu Thr Pro Glu Arg 
210 215 220 

gca ttt cat att cag ctt tta ctg ate cat ttt tat cgc cgt gtc gtc 720 
Ala Phe His lie Gin Leu Leu Leu He His Phe Tyr Arg Arg Val Val 
225 230 235 240 



ctt aaa gac cca ttg ttg ccg gag gag ttg ctt ccg gca cac tgg gca 
Leu Lys Asp Pro Leu Leu Pro Glu Glu Leu Leu Pro Ala His Trp Ala 
245 250 255 



768 



ggg cat acg gcg cgt cag ctg tgt ate aac att tat cag cgc gta gcg 816 

Gly His Thr Ala Arg Gin Leu Cys He Asn He Tyr Gin Arg Val Ala 
260 265 270 

cct get get tta gcg ttc gtt agt gaa aaa ggt gaa acc teg gtc ggt 8 64 

Pro Ala Ala Leu Ala Phe Val Ser Glu Lys Gly Glu Thr Ser Val Gly 
275 280 285 

gaa ctg cct gcg ccg gga age ctg tat ttt caa cgt ttt ggc ggc ttg 912 

Glu Leu Pro Ala Pro Gly Ser Leu Tyr Phe Gin Arg Phe Gly Gly Leu 

290 295 300 

aat att gaa cag gag gcg tta tgc caa ttt ate aga tag 951 

Asn He Glu Gin Glu Ala Leu Cys Gin Phe He Arg * 
305 310 315 



<210> 208 
<211> 591 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . (591) 

<400> 208 

atg cca att tat cag ata gac ggt ctg act ccg gtt gtg cca gaa gag 48 

Met Pro He Tyr Gin He Asp Gly Leu Thr Pro Val Val Pro Glu Glu 

15 10 15 

agt ttt gtc cat ccg aca gcg gta ttg ate ggc gat gtt att etc ggc 96 
Ser Phe Val His Pro Thr Ala Val Leu He Gly Asp Val He Leu Gly 
20 25 30 

aag ggc gtt tac gtt ggg cca aat gcc age ctg cgt ggc gat ttt ggt 144 
Lys Gly Val Tyr Val Gly Pro Asn Ala Ser Leu Arg Gly Asp Phe Gly 
35 40 45 

cgt ate gtg gtg aaa gat ggc gcg aac att cag gat aat tgc gtt atg 192 
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Arg lie Val Val Lys Asp Gly Ala Asn lie Gin Asp Asn Cys Val Met 
50 55 60 

cac ggt ttt ccc gag cag gat act gtt gta gga gaa gat gga cat att 
His Gly Phe Pro Glu Gin Asp Thr Val Val Gly Glu Asp Gly His lie 
65 70 75 80 

ggt cat age get ate ctt cac ggc tgc att ate cgc cgc aat gca tta 
Gly His Ser Ala He Leu His Gly Cys He He Arg Arg Asn Ala Leu 
85 90 95 



cga ccg aaa cag taa 
Arg Pro Lys Gin * 
195 



<210> 209 
<211> 1281 
<212> DNA 

<213> Escherichia coli 



240 



288 



gtg gga atg aac gcg gta gtg atg gac ggt gcg gtg att ggc gag aac 336 
Val Gly Met Asn Ala Val Val Met Asp Gly Ala Val He Gly Glu Asn 
100 105 110 

age att gtt ggt gca tec gca ttt gtg aaa gee aaa gca gaa atg cca 384 
Ser He Val Gly Ala Ser Ala Phe Val Lys Ala Lys Ala Glu Met Pro 
115 120 125 

get aat tac ctg att gtc ggc age ccg gcg aaa gcg att cgt gaa etc 
Ala Asn Tyr Leu He Val Gly Ser Pro Ala Lys Ala He Arg Glu Leu 
130 135 140 

agt gag cag gag ttg gca tgg aaa aag cag ggt acg cat gag tac cag 
Ser Glu Gin Glu Leu Ala Trp Lys Lys Gin Gly Thr His Glu Tyr Gin 
145 150 155 160 

gtg ctg gtg aca cgc tgt aag cag acg tta cat caa gtc gag cca ttg 
Val Leu Val Thr Arg Cys Lys Gin Thr Leu His Gin Val Glu Pro Leu 
165 170 175 

egg gaa att gaa cct ggc agg aaa cgc ctg gta ttt gat gag aat ctg 576 
Arg Glu He Glu Pro Gly Arg Lys Arg Leu Val Phe Asp Glu Asn Leu 
180 185 190 



432 



480 



528 



591 



<220> 

<221> CDS 

<222> (1) . . . (1281) 

<400> 209 

gtg aaa cat ctg aca gaa atg gtg aga cag cac aaa gcg ggc aaa aca 
Met Lys His Leu Thr Glu Met Val Arg Gin His Lys Ala Gly Lys Thr 
15 10 15 



48 



aat gga att tat gec gtt tgt tec gca cat ccg ctg gtg ctg gaa get 96 
Asn Gly He Tyr Ala Val Cys Ser Ala His Pro Leu Val Leu Glu Ala 
20 25 30 

gca ate cgc tac gee agt gca aac caa acg ccg tta ctg att gaa gca 144 
Ala He Arg Tyr Ala Ser Ala Asn Gin Thr Pro Leu Leu He Glu Ala 
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35 40 45 

acc tec aat cag gta gac cag ttc ggc ggt tat acc gga atg acg ccc 192 
Thr Ser Asn Gin Val Asp Gin Phe Gly Gly Tyr Thr Gly Met Thr Pro 
50 55 60 

gec gat ttt cgc ggc ttt gtt tgt cag etc gec gac teg ttg aat ttc 240 
Ala Asp Phe Arg Gly Phe Val Cys Gin Leu Ala Asp Ser Leu Asn Phe 
65 70 75 80 

ccg cag gat gcg ttg att ctg ggt ggt gac cat ctg ggg cca aac cgc 288 
Pro Gin Asp Ala Leu lie Leu Gly Gly Asp His Leu Gly Pro Asn Arg 
85 90 95 

tgg caa aac ctg ccg gee get cag gca atg gec aat gee gat gat ttg 336 
Trp Gin Asn Leu Pro Ala Ala Gin Ala Met Ala Asn Ala Asp Asp Leu 
100 105 110 

att aaa age tac gtt gcg gca gga ttc aaa aaa ate cac ctt gat tgc 384 
lie Lys Ser Tyr Val Ala Ala Gly Phe Lys Lys lie His Leu Asp Cys 
115 " 120 125 

age atg tec tgt cag gac gat ccg att ccc tta act gat gac ate gtg 432 
Ser Met Ser Cys Gin Asp Asp Pro lie Pro Leu Thr Asp Asp lie Val 
130 135 140 

get gaa cgc gee gee cgt ctg gcg aaa gtg gcg gaa gaa acc tgt ctt 480 
Ala Glu Arg Ala Ala Arg Leu Ala Lys Val Ala Glu Glu Thr Cys Leu 
145 150 155 160 

gaa cac ttt ggc gaa gee gat ctg gag tat gtc att ggt acc gaa gtg 528 
Glu His Phe Gly Glu Ala Asp Leu Glu Tyr Val He Gly Thr Glu Val 
165 170 175 

ccg gta cct ggc ggc gcg cat gaa acc tta age gag ctg gcg gtc acc 576 
Pro Val Pro Gly Gly Ala His Glu Thr Leu Ser Glu Leu Ala Val Thr 
180 185 190 

acg ccg gat gee gec cgc gee acg ctg gaa gec cat cgt cac gec ttt 624 
Thr Pro Asp Ala Ala Arg Ala Thr Leu Glu Ala His Arg His Ala Phe 
195 200 205 

gaa aag caa ggt ttg aat gee ate tgg cca cgc ate att gec ctg gtg 672 
Glu Lys Gin Gly Leu Asn Ala He Trp Pro Arg He lie Ala Leu Val 
210 * 215 220 

gtt caa ccc ggc gtc gaa ttc gat cac acc aac gtt att gat tat cag 720 
Val Gin Pro Gly Val Glu Phe Asp His Thr Asn Val He Asp Tyr Gin 
225 230 235 240 

ccc gee aaa gcg age gec tta age cag atg gtc gaa aac tac gaa acg 768 
Pro Ala Lys Ala Ser Ala Leu Ser Gin Met Val Glu Asn Tyr Glu Thr 
245 250 255 

ctg att ttc gaa gcg cac tct acc gat tat caa acg ccg caa teg ctg 816 
Leu He Phe Glu Ala His Ser Thr Asp Tyr Gin Thr Pro Gin Ser Leu 
260 265 270 

cgc cag ctg gtg att gac cac ttt gee att ctg aaa gtt ggc cca gcg 864 
Arg Gin Leu Val He Asp His Phe Ala He Leu Lys Val Gly Pro Ala 
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275 280 285 

ctg acc ttc gcc ctg cgt gaa get ctg ttc tct ctg gcg gcg att gaa 912 
Leu Thr Phe Ala Leu Arg Glu Ala Leu Phe Ser Leu Ala Ala lie Glu 
290 295 300 

gaa gaa ctg gtg cca gcg aaa gcc tgt tct ggt ctg cgt cag gtg ctg 
Glu Glu Leu Val Pro Ala Lys Ala Cys Ser Gly Leu Arg Gin Val Leu 
305 310 315 320 

gaa gac gtg atg etc gac cgc ccg gaa tac tgg caa age cac tac cac 
Glu Asp Val Met Leu Asp Arg Pro Glu Tyr Trp Gin Ser His Tyr. His 
325 330 335 

ggt gac ggc aac gcg cgt cgt ctg gcg cgt ggt tat age tac teg gat 
Gly Asp Gly Asn Ala Arg Arg Leu Ala Arg Gly Tyr Ser Tyr Ser Asp 
340 345 350 

cgc gtg cgc tat tac tgg ccg gac age cag att gat gac get ttc get 
Arg Val Arg Tyr Tyr Trp Pro Asp Ser Gin lie Asp Asp Ala Phe Ala 
355 ~ 360 365 



cag cca acg cca egg gaa etc att ate aac cat att cag gac ate ctg 
Gin Pro Thr Pro Arg Glu Leu lie lie Asn His lie Gin Asp lie Leu 
405 410 415 

gcg cag tac cac aca gcc tgt gaa ggc caa taa 
Ala Gin Tyr His Thr Ala Cys Glu Gly Gin * 
420 425 



<210> 210 
<211> 510 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . (510) 

<400> 210 

gtg aag gcc aat aag caa aac aaa gag gaa cac get atg cca aat att 

Met Lys Ala Asn Lys Gin Asn Lys Glu Glu His Ala Met Pro Asn lie 
1 5 10 15 

gtt tta age egg att gat gaa cgc ttg att cac ggt cag gtc ggc gtt 
Val Leu Ser Arg He Asp Glu Arg Leu He His Gly Gin Val Gly Val 
20 25 30 



960 



1008 



1056 



1104 



cat ctg gta cgt aat ctg gcg gat tea cca att ccg ctg ccg ctg ate .1152 
His Leu Val Arg Asn Leu Ala Asp Ser Pro He Pro Leu Pro Leu He 
370 375 380 

age cag tat ctg ccg ctg cag tac gtg aaa gtt cgc tec ggc gag ctg 1200 
Ser Gin Tyr Leu Pro Leu Gin Tyr, Val Lys Val Arg Ser Gly Glu Leu 
385 " 390 395 400 



1248 



1281 



48 



96 



caa tgg gtc gga ttt gcg ggg gca aat ctg gtg ctg gta gcc aac gat 144 
Gin Trp Val Gly Phe Ala Gly Ala Asn Leu Val Leu Val Ala Asn Asp 
35 " 40 45 
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gag gtt gcc gaa gat ccg gta caa caa aac ctg atg gaa atg gta ctg 192 

Glu Val Ala Glu Asp Pro Val Gin Gin Asn Leu Met Glu Met Val Leu 

50 55 60 

gca gaa ggg ate gcc gta cgt ttc tgg acg ctg caa aaa gtt ate gac 240 

Ala Glu Gly lie Ala Val Arg Phe Trp Thr Leu Gin Lys Val lie Asp 

65 70 75 80 

aac att cat cgc gcc gcc gat cga cag aaa ate ctg ctg gtt tgt aaa 288 

Asn lie His Arg Ala Ala Asp Arg Gin Lys He Leu Leu Val Cys Lys 

85 90 95 

aca ccc gcc gat ttc ctg acg ctg gtg aaa ggt ggc gtt ccg gtg aat 336 

Thr Pro Ala Asp Phe Leu Thr Leu Val Lys Gly Gly Val Pro Val Asn 

100 105 110 

cgc att aac gtt ggc aat atg cac tac gcc aat ggc aaa caa caa ate 384 

Arg lie Asn Val Gly Asn Met His Tyr Ala Asn Gly Lys Gin Gin lie 

115 ' 120 125 

gcc aaa acg gtt tct gtg gat gcg ggc gat ate gca gca ttt aac gac 4 32 

Ala Lys Thr Val Ser Val Asp Ala Gly Asp lie Ala Ala Phe Asn Asp 

130 135 140 

ctg aaa acc get ggg gtg gaa tgc ttc gtt cag ggc gtc ccg aca gag 4 80 

Leu Lys Thr Ala Gly Val Glu Cys Phe Val Gin Gly Val Pro Thr Glu 

145 150 155 160 

cct get gtg gac etc ttt aaa tta ctt tga 510 

Pro Ala Val Asp Leu Phe Lys Leu Leu * 
165 



<210> 211 
<211> 402 
<212> DNA 

<213> Escherichia coli 



<220> 

<221> CDS 

<222> (1) . . . (402) 

<400> 211 

atg gaa ate age ctg ttg cag gca ttt gcg ttg ggc att ate gcc ttt 48 
Met Glu lie Ser Leu Leu Gin Ala Phe Ala Leu Gly lie He Ala Phe 
15 10 15 

ate get ggc ctg gat atg ttt aac ggc eta acc cat atg cac cgc ccg 96 
He Ala Gly Leu Asp Met Phe Asn Gly Leu Thr His Met His Arg Pro 
20 25 30 

gtg gtc etc ggc ccg ttg gtc ggg ctg gta ctt ggc gat ctg cat acc 144 
Val Val Leu Gly Pro Leu Val Gly Leu Val Leu Gly Asp Leu His Thr 
35 40 45 

gga att tta acc ggc ggt acg ctg gaa ctg gtg tgg atg ggg ctg gcc 192 
Gly He Leu Thr Gly Gly Thr Leu Glu Leu Val Trp Met Gly Leu Ala 
50 55 60 
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ccg ctg gcg ggc gca cag ccg cct aac gtg att ate ggt act ate gtc 
Pro Leu Ala Gly Ala Gin Pro Pro Asn Val lie lie Gly Thr He Val 
65 70 75 80 



240 



ggc acg gcg ttt gec att act act ggc gtg aaa ccc gat gtc gca gta 288 
Gly Thr Ala Phe Ala He Thr Thr Gly Val Lys Pro Asp Val Ala Val 
85 90 95 

ggt gtc gec gta cct ttc get gtc gca gta cag atg ggg att acc ttc 336 
Gly Val Ala Val Pro Phe Ala Val Ala Val Gin Met Gly He Thr Phe 
100 105 HO 



ctg ttc teg gtg atg tec ggc gtg atg tct cgc tgc gac ctg gca aca 
Leu Phe Ser Val Met Ser Gly Val Met Ser Arg Cys Asp Leu Ala Thr 
115 120 125 



aac ccg cgc cgc att tga 
Asn Pro Arg Arg He * 
130 



384 



402 



<210> 212 
<211> 504 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . . (504) 

<400> 212 

gtg cat tgc tat aac ggg atg aca ggt tta cat cac cgc gaa ccg gga 48 

Met His Cys Tyr Asn Gly Met Thr Gly Leu His His Arg Glu Pro Gly 
15 10 15 

atg gtt ggc gcg gga tta acg gac aag cgc gec tgg ctg gaa ctg ata 96 
Met Val Gly Ala Gly Leu Thr Asp Lys Arg Ala Trp Leu Glu Leu He 
20 25 30 

gec gat ggt cat cat gtg cat ccg gcg gca atg teg ctg tgt tgt tgc 144 
Ala Asp Gly His His Val His Pro Ala Ala Met Ser Leu Cys Cys Cys 
35 40 45 

tgt gcg aaa gag aga ate gta ctg ate acc gac gcg atg cag gca get 192 
Cys Ala Lys Glu Arg He Val Leu He Thr Asp Ala Met Gin Ala Ala 
50 55 60 

ggg atg ccg gat ggt cgc tat acg tta tgt ggt gaa gaa gtg cag atg 240 
Gly Met Pro Asp Gly Arg Tyr Thr Leu Cys Gly Glu Glu Val Gin Met . 
65 70 75 80 

cac ggt ggc gtt gtc cgt acc gcg tct ggt ggg ctg gcg ggc agt acg 288 
His Gly Gly Val Val Arg Thr Ala Ser Gly Gly Leu Ala Gly Ser Thr 
85 90 95 

ctg tct gtt gat gcg gca gtg cgc aat atg gtc gag ttg acg ggc gta 336 
Leu Ser Val Asp Ala Ala Val Arg Asn Met Val Glu Leu Thr Gly Val 
100 105 110 



acg cct gcg gaa gec ate cat atg gcg teg ctg cat ccg gcg cga atg 



384 



-151- 



WO 01/34810 



PCT/USOO/30950 



Thr Pro Ala Glu Ala lie His Met Ala Ser Leu His Pro Ala Arg Met 
115 120 125 

ctg ggt gtt gat ggt gtt ctg gga teg ctt aaa ccg ggc aaa cgc gcc 432 
Leu Gly Val Asp Gly Val Leu Gly Ser Leu Lys Pro Gly Lys Arg Ala 
130 135 140 

aga gtc gtt gcg ctg gat age ggg eta cat gtg caa caa ate tgg att 4 80 

Arg Val Val Ala Leu Asp Ser Gly Leu His Val Gin Gin He Trp He 
145 150 155 160 

cag ggt caa tta get teg ttt tga 504 
Gin Gly Gin Leu Ala Ser Phe * 
165 



<210> 213 
<211> 1155 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . . (1155) 

<400> 213 

atg cca gaa aat tac ace cct get gcc gcc gca acc ggt aca tgg act 4 8 

Met Pro Glu Asn Tyr Thr Pro Ala Ala Ala Ala Thr Gly Thr Trp Thr 
15 10 15 

gaa gaa gag ate cgc cat cag cct cgc gca tgg ate cgt tea etc acc 96 
Glu Glu Glu He Arg His Gin Pro Arg Ala Trp He Arg Ser Leu Thr 
20 25 30 

aac ate gac gcg eta cgt tec gcg etc aat aac ttc ctt gaa ccg tta 144 
Asn He Asp Ala Leu Arg Ser Ala Leu Asn Asn Phe Leu Glu Pro Leu 
35 40 45 

ctg cgc aaa gag aat ctg egg ate ate ctg acc gga gcc gga acg teg 192 
Leu Arg Lys Glu Asn Leu Arg He He Leu Thr Gly Ala Gly Thr Ser 
50 55 60 

gca ttt ate ggt gac ate ate gcg ccg tgg etc gcc age cat acc ggt . 240 
Ala Phe lie Gly Asp He He Ala Pro Trp Leu Ala Ser His Thr Gly 
65 ~ 70 75 80 

aaa aac ttc age gcc gta ccg acc acc gat ctg gtc acc aat ccg atg 288 
Lys Asn Phe Ser Ala Val Pro Thr Thr Asp Leu Val Thr Asn Pro Met 
85 90 95 

gac tac ctg aac cca get cat ccg ctg ctg ttg ate tec ttc ggt cga 336 
Asp Tyr Leu Asn Pro Ala His Pro Leu Leu Leu He Ser Phe Gly Arg 
100 105 110 

tec ggc aac age ccg gaa age gtc gca gcc gtg gaa ctg gca aat caa 384 
Ser Gly Asn Ser Pro Glu Ser Val Ala Ala Val Glu Leu Ala Asn Gin 
115 120 125 

ttt gta ccg gaa tgc tat cac ctg ccg ate acc tgc aac gaa gcg ggc 432 
Phe Val Pro Glu Cys Tyr His Leu Pro He Thr Cys Asn Glu Ala Gly 



-152- 



WO 01/34810 



PCT/USOO/30950 



130 135 140 

get ctt tac caa aac gcg ate aac age gat aac gcg ttt gee ctg ctg 480 
Ala Leu Tyr Gin Asn Ala He Asn Ser Asp Asn Ala Phe Ala Leu Leu 
145 150 155 160 

atg ccc gca gaa acg cac gat cgc ggc ttt gcg atg acc age age att 528 
Met Pro Ala Glu Thr His Asp Arg Gly Phe Ala Met Thr Ser Ser He 
165 170 175 

acc acc atg atg gee age tgc etc gcg gtt ttc gca cct gag acg ate 57 6 

Thr Thr Met Met Ala Ser Cys Leu Ala Val Phe Ala Pro Glu Thr He 
180 185 190 



aac age caa acc ttc cgc gac gtg gcg gat cgt tgc cag gcg ate ctg 
Asn Ser Gin Thr Phe Arg Asp Val Ala Asp Arg Cys Gin Ala He Leu 
195 200 205 



aaa egg ate gtt tat etc ggt age ggt ggc tta cag ggc gca gca cgc 
Lys Arg He Val Tyr Leu Gly Ser Gly Gly Leu Gin Gly Ala Ala Arg 
225 230 235 240 



624 



acc tea ctg ggc gat ttc age gaa ggt gtg ttt ggt tac gca ccg tgg 672 
Thr Ser Leu Gly Asp Phe Ser Glu Gly Val Phe Gly Tyr Ala Pro Trp 
210 215 220 



720 



gag teg gcg ctg aaa gtg ctg gaa ctg acg gcg ggt aaa ctg gcg gec 768 
Glu Ser Ala Leu Lys Val Leu Glu Leu Thr Ala Gly Lys Leu Ala Ala 
245 250 255 

ttt tat gat tct cca acc gga ttc cgt cat gga cca aaa teg ctg gtc 816 
Phe Tyr Asp Ser Pro Thr Gly Phe Arg His Gly Pro Lys Ser Leu Val 
260 265 270 

gat gac gaa acg ctg gtg gtg gta ttt gtc tec age cac cct tac acc 864 
Asp Asp Glu Thr Leu Val Val Val Phe Val Ser Ser His Pro Tyr Thr 
275 280 285 

cgt cag tat gat ctt gat ctg ctg get gaa ctt cgc cgt gac aac cag 912 
Arg Gin Tyr Asp Leu Asp Leu Leu Ala Glu Leu Arg Arg Asp Asn Gin 
290 295 300 

gca atg cgt gta ate gee ate gee gcg gaa age age gac ate gtc get 960 
Ala Met Arg Val lie Ala He Ala Ala Glu Ser Ser Asp He Val Ala 
305 310 315 320 

gee ggt cca cat ate ate ctg cca ccg tea cgt cac ttt ate gac gtt 1008 
Ala Gly Pro His He He Leu Pro Pro Ser Arg His Phe He Asp Val 
325 330 335 

gag cag gca ttt tgc ttc ctg atg tac gee cag acg ttt gca ctg atg 1056 
Glu Gin Ala Phe Cys Phe Leu Met Tyr Ala Gin Thr Phe Ala Leu Met 
340 345 350 

cag teg ctg cac atg ggc aat acg ccg gat acc cca tea gee agt ggc 1104 
Gin Ser Leu His Met Gly Asn Thr Pro Asp Thr Pro Ser Ala Ser Gly 
355 ~ 360 365 

acc gtt aac cgc gtg gtg caa ggc gta ate att cat ccg tgg cag gca 1152 
Thr Val Asn Arg Val Val Gin Gly Val He He His Pro Trp Gin Ala 
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370 375 380 

taa 1155 



<210> 214 
<211> 861 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . . (861) 

<400> 214 

atg age att ate tec act aaa tat ctg tta cag gac gec cag gec aat 4 8 

Met Ser lie He Ser Thr Lys Tyr Leu Leu Gin Asp Ala Gin Ala Asn 
1 5 10 15 

ggc tac gcg gtg cct get ttt aac att cat aac gec gag acg ate caa 96 
Gly Tyr Ala Val Pro Ala Phe Asn He His Asn Ala Glu Thr He Gin 
20 25 30 

gcg ate etc gaa gtg tgc agt gaa atg cga teg ccg gtg ate etc gec 144 
Ala He Leu Glu Val Cys Ser Glu Met Arg Ser Pro Val He Leu Ala 
35 40 45 

gga acg ccg ggg ace ttt aaa cac ate gcg ctg gaa gag ate tac gec 192 
Gly Thr Pro Gly Thr Phe Lys His He Ala Leu Glu Glu He Tyr Ala 
50 55 60 



ctg tgt age gec tat tec aca ace tac aac atg cca ctg gcg ctg cat 
Leu Cys Ser Ala Tyr Ser Thr Thr Tyr Asn Met Pro Leu Ala Leu His 
65 70 75 80 



atg age gtt gac gec gaa agt gca ttc ctg ace gat cca caa gaa get 
Met Ser Val Asp Ala Glu Ser Ala Phe Leu Thr Asp Pro Gin Glu Ala 
145 150 155 160 



240 



etc gac cac cac gaa teg ctg gat gat att cgc cgt aaa gtc cac gca 288 

Leu Asp His His Glu Ser Leu Asp Asp He Arg Arg Lys Val His Ala 
85 90 95 

ggt gtg cgc agt gcg atg ate gac ggc age cac ttc ccg ttt gee gag 336 

Gly Val Arg Ser Ala Met He Asp Gly Ser His Phe Pro Phe Ala Glu 
100 105 110 

aac gtg aag ctg gtg aaa teg gtt gtt gac ttc tgc cac tea caa gat 384 

Asn Val Lys Leu Val Lys Ser Val Val Asp Phe Cys His Ser Gin Asp 
115 120 125 

tgc age gtg gaa gca gaa ctg ggc cgc ctg ggc ggt gtt gaa gat gac 432 

Cys Ser Val Glu Ala Glu Leu Gly Arg Leu Gly Gly Val Glu Asp Asp 

130 135 140 



480 



aaa cgc ttt gtc gaa ctg act ggc gtc gac age ctg gcg gta gcg att 528 
Lys Arg Phe Val Glu Leu Thr Gly Val Asp Ser Leu Ala Val Ala He 
165 170 175 
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ggt acg gcg cac ggc tta tac age aaa acg ccg aag att gat ttc cag 576 

Gly Thr Ala His Gly Leu Tyr Ser Lys Thr Pro Lys lie Asp Phe Gin 

180 185 190 

egg ctg gcg gaa att cgt gaa gtg gtg gat gtt cct ctg gtg ctg cat 624 

Arg Leu Ala Glu lie Arg Glu Val Val Asp Val Pro Leu Val Leu His 

195 200 205 

ggt gec age gat gtt ccg gat gaa ttt gtc cgt cgc act att gaa ctt 672 

Gly Ala Ser Asp Val Pro Asp Glu Phe Val Arg Arg Thr He Glu Leu 

210 215 220 

ggc gtc aca aaa gtg aac gtt gec aca gaa tta aaa ata gec ttc get 720 

Gly Val Thr Lys Val Asn Val Ala Thr Glu Leu Lys He Ala Phe Ala 

225 230 235 240 

ggc gcg gtt aaa gee tgg ttt gcg gaa aat ccg cag ggt aat gat cct 768 

Gly Ala Val Lys Ala Trp Phe Ala Glu Asn Pro Gin Gly Asn Asp Pro 
245 250 255 

cgt tat tat atg cgc gtc gga atg gat gcg atg aaa gaa gtt gtc aga 816 

Arg Tyr Tyr Met Arg Val Gly Met Asp Ala Met Lys Glu Val Val Arg 

260 265 270 

aat aaa att aat gtc tgt ggt tea gcg aat cga att tea gca taa 8 61 

Asn Lys lie Asn Val Cys Gly Ser Ala Asn Arg He Ser Ala * 

275 280 285 



<210> 215 
<211> 357 
<212> DNA 

<213> Escherichia coli 



<220> 

<221> CDS 

<222> (1) . . . (357) 

<400> 215 

gtg gec cgt ata gca ggc att aac att cct gat cat aag cat gee gta 48 

Met Ala Arg He Ala Gly He Asn He Pro Asp His Lys His Ala Val 

15 10 15 

ate gca tta act teg att tat ggc gtc ggc aag ace cgt tct aaa gec 96 
He Ala Leu Thr Ser He Tyr Gly Val Gly Lys Thr Arg Ser Lys Ala 
20 25 30 

ate ctg get gca gcg ggt ate get gaa gat gtt aag ate agt gag ctg 14 4 

He Leu Ala Ala Ala Gly He Ala Glu Asp Val Lys He Ser Glu Leu 
35 40 45 

tct gaa gga caa ate gac acg ctg cgt gac gaa gtt gec aaa ttt gtc 192 
Ser Glu Gly Gin He Asp Thr Leu Arg Asp Glu Val Ala Lys Phe Val 
50 .55 60 

gtt gaa ggt gat ctg cgc cgt gaa ate age atg age ate aag cgc ctg 240 
Val Glu Gly Asp Leu Arg Arg Glu He Ser Met Ser He Lys Arg Leu 
65 70 -75 80 
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atg gat ctt ggt tgc tat cgc ggt ttg cgt cat cgt cgt ggt etc ccg 288 

Met Asp Leu Gly Cys Tyr Arg Gly Leu Arg His Arg Arg Gly Leu Pro 

85 90 95 

gtt cgc ggt cag cgt acc aag acc aac gca cgt acc cgt aag ggt ccg 336 

Val Arg Gly Gin Arg Thr Lys Thr Asn Ala Arg Thr Arg Lys Gly Pro 

100 105 110 

cgc aaa ccg ate aag aaa taa 357 

Arg Lys Pro He Lys Lys * 
115 



<210> 216 
<211> 390 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . . (390) 

<400> 216 

atg gca aag gca cca att cgt gca cgt aaa cgt gta aga aaa caa gtc 4 8 

Met Ala Lys Ala Pro He Arg Ala Arg Lys Arg Val Arg Lys Gin Val 

15 10 15 

tct gac ggc gtg get cat ate cat get tct ttc aac aac acc ate gtg 96 
Ser Asp Gly Val Ala His He His Ala Ser Phe Asn Asn Thr He Val 
20 25 30 

act ate act gat cgt cag ggt aac gcg ttg ggt tgg gca aca gee ggt 14 4 

Thr He Thr Asp Arg Gin Gly Asn Ala Leu Gly Trp Ala Thr Ala Gly 
35 40 45 

ggt tec ggt ttc cgt ggt tct cgc aaa tec act ccg ttt gca get cag 192 
Gly Ser Gly Phe Arg Gly Ser Arg Lys Ser Thr Pro Phe Ala Ala Gin 
50 55 60 

gtt gca gca gag cgt tgc get gac gee gtg aaa gaa tac ggc ate aag 24 0 

Val Ala Ala Glu Arg Cys Ala Asp Ala Val Lys Glu Tyr Gly He Lys 
65 70 75 80 

aat ctg gaa gtt atg gtt aaa ggt ccg ggt cca ggc cgc gaa tct act 288 
Asn Leu Glu Val Met Val Lys Gly Pro Gly Pro Gly Arg Glu Ser Thr 
85 90 95 

att cgt get ctg aac gee gca ggt ttc cgc ate act aac att act gat 336 
He Arg Ala Leu Asn Ala Ala Gly Phe Arg He Thr Asn He Thr Asp 
100 105 110 

gtg act ccg ate cct cat aac ggt tgt cgt ccg ccg aaa aaa cgt cgc 384 
Val Thr Pro He Pro His Asn Gly Cys Arg Pro Pro Lys Lys Arg Arg 
115 120 125 

gta taa 390 
Val * 
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<210> 217 
<211> 621 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . . (621) 

<400> 217 

atg gca aga tat ttg ggt cct aag etc aag ctg age cgt cgt gag ggc 

Met Ala Arg Tyr Leu Gly Pro Lys Leu Lys Leu Ser Arg Arg Glu Gly 

1 5 10 15 

acc gac tta ttc ctt aag tct ggc gtt cgc gcg ate gat acc aag tgt 
Thr Asp Leu Phe Leu Lys Ser Gly Val Arg Ala lie Asp Thr Lys Cys 
20 25 30 



ctg get gag cag cgt gaa aag cca acc tgg ctg gaa gtt gat get ggc 
Leu Ala Glu Gin Arg Glu Lys Pro Thr Trp Leu Glu Val Asp Ala Gly 
165 170 175 



48 



96 



aaa att gaa caa get cct ggc cag cac ggt gcg cgt aaa ccg cgt ctg 144 
Lys He Glu Gin Ala Pro Gly Gin His Gly Ala Arg Lys Pro Arg Leu 
35 40 45 

tct gac tat ggt gtg cag ttg cgt gaa aag caa aaa gtt cgc cgt ate 192 
Ser Asp Tyr Gly Val Gin Leu Arg Glu Lys Gin Lys Val Arg Arg He 
50 55 60 

tat ggt gtg ctg gag cgt cag ttc cgt aac tac tac aaa gaa gca gca 240 
Tyr Gly Val Leu Glu Arg Gin Phe Arg Asn Tyr Tyr Lys Glu Ala Ala 
65 70 75 80 

cgt ctg aaa ggc aac acc ggt gaa aac ctg ttg get ctg ctg gaa ggt 
Arg Leu Lys Gly Asn Thr Gly Glu Asn Leu Leu Ala Leu Leu Glu Gly 
85 90 95 

cgt ctg gac aac gtt gta tac cgt atg ggc ttc ggt gee act cgt gca 
Arg Leu Asp Asn Val Val Tyr Arg Met Gly Phe Gly Ala Thr Arg Ala 
100 105 HO 

gaa gca cgt cag ctg gtt age cat aaa gca att atg gta aac ggt cgt 
Glu Ala Arg Gin Leu Val Ser His Lys Ala He Met Val Asn Gly Arg 
115 120 125 

gtt gtt aac ate get tct tat cag gtt agt ccg aat gac gtt gta age 
Val Val Asn He Ala Ser Tyr Gin Val Ser Pro Asn Asp Val Val Ser 
130 135 140 

att cgt gag aaa gcg aag aag cag tct cgc gtg aaa gec get ctg gag 480 
He Arg Glu Lys Ala Lys Lys Gin Ser Arg Val Lys Ala Ala Leu Glu 
145 " 150 155 160 



288 



336 



384 



432 



528 



aag atg gaa ggt acg ttt aag cgt aag ccg gag cgt tct gat ctg tct 57 6 

Lys Met Glu Gly Thr Phe Lys Arg Lys Pro Glu Arg Ser Asp Leu Ser 
180 185 190 

gcg gac att aac gaa cac ctg ate gtc gag ctt tac tec aag taa 621 

Ala Asp He Asn Glu His Leu He Val Glu Leu Tyr Ser Lys * 
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195 200 205 



<210> 218 
<211> 990 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . . (990) 

<400> 218 

atg cag ggt tct gtg aca gag ttt eta aaa ccg cgc ctg gtt gat ate 48 

Met Gin Gly Ser Val Thr Glu Phe Leu Lys Pro Arg Leu Val Asp lie 

15 10 15 

gag caa gtg agt teg acg cac gee aag gtg ace ctt gag cct tta gag 96 
Glu Gin Val Ser Ser Thr His Ala Lys Val Thr Leu Glu Pro Leu Glu 
20 25 30 

cgt ggc ttt ggc cat act ctg ggt aac gca ctg cgc cgt att ctg etc 144 
Arg Gly Phe Gly His Thr Leu Gly Asn Ala Leu Arg Arg lie Leu Leu 
35 4 0 4 5 

tea teg atg ccg ggt tgc gcg gtg ace gag gtt gag att gat ggt gta 192 
Ser Ser Met Pro Gly Cys Ala Val Thr Glu Val Glu lie Asp Gly Val 
50 55 60 

eta cat gag tac age acc aaa gaa ggc gtt cag gaa gat ate ctg gaa 240 
Leu His Glu Tyr Ser Thr Lys Glu Gly Val Gin Glu Asp lie Leu Glu 
65 70 75 80 

ate ctg etc aac ctg aaa ggg ctg gcg gtg aga gtt cag ggc aaa gat 288 
lie Leu Leu Asn Leu Lys Gly Leu Ala Val Arg Val Gin Gly Lys Asp 
85 90 95 

gaa gtt att ctt acc ttg aat aaa tct ggc att ggc cct gtg act gca 336 
Glu Val lie Leu Thr Leu Asn Lys Ser Gly He Gly Pro Val Thr Ala 
100 105 110 

gee gat ate acc cac gac ggt gat gtc gaa ate gtc aag ccg cag cac 384 
Ala Asp He Thr His Asp Gly Asp Val Glu He Val Lys Pro Gin His 
115 120 125 

gtg ate tgc cac ctg acc gat gag aac gcg tct att age atg cgt ate 432 
Val He Cys His Leu Thr Asp Glu Asn Ala Ser He Ser Met Arg He 
130 135 140 

aaa gtt cag cgc ggt cgt ggt tat gtg ccg get tct acc cga att cat 480 
Lys Val Gin Arg Gly Arg Gly Tyr Val Pro Ala Ser Thr Arg He His 
145 150 155 160 

teg gaa gaa gat gag cgc cca ate ggc cgt ctg ctg gtc gac gca tgc 528 
Ser Glu Glu Asp Glu Arg Pro He Gly Arg Leu Leu Val Asp Ala Cys 
165 170 175 

tac age cct gtg gag cgt att gee tac aat gtt gaa gca gcg cgt gta 576 
Tyr Ser Pro Val Glu Arg He Ala Tyr Asn Val Glu Ala Ala Arg Val 
180 185 190 
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gaa cag cgt acc gac ctg gac aag ctg gtc ate gaa atg gaa acc aac 
Glu Gin Arg Thr Asp Leu Asp Lys Leu Val He Glu Met Glu Thr Asn 
195 200 205 

ggc aca ate gat cct gaa gag gcg att cgt cgt gcg gca acc att ctg 
Gly Thr He Asp Pro Glu Glu Ala He Arg Arg Ala Ala Thr He Leu 
210 215 220 

get gaa caa ctg gaa get ttc gtt gac tta cgt gat gta cgt cag cct 
Ala Glu Gin Leu Glu Ala Phe Val Asp Leu Arg Asp Val Arg Gin Pro 
225 230 235 240 

gaa gtg aaa gaa gag aaa cca gag ttc gat ccg ate ctg ctg cgc cct 
Glu Val Lys Glu Glu Lys Pro Glu Phe Asp Pro He Leu Leu Arg Pro 
245 250 255 

gtt gac gat ctg gaa ttg act gtc cgc tct get aac tgc ctt aaa gca 
Val Asp Asp Leu Glu Leu Thr Val Arg Ser Ala Asn Cys Leu Lys Ala 
260 265 270 

gaa get ate cac tat ate ggt gat ctg gta cag cgt acc gag gtt gag 
Glu Ala He His Tyr He Gly Asp Leu Val Gin Arg Thr Glu Val Glu 
275 ' 280 285 

etc ctt aaa acg cct aac ctt ggt aaa aaa tct ctt act gag att aaa 
Leu Leu Lys Thr Pro Asn Leu Gly Lys Lys Ser Leu Thr Glu He Lys 
290 295 300 

gac gtg ctg get tec cgt gga ctg tct ctg ggc atg cgc ctg gaa aac 
Asp Val Leu Ala Ser Arg Gly Leu Ser Leu Gly Met Arg Leu Glu Asn 
305 310 315 320 

tgg cca ccg gca age ate get gac gag taa 
Trp Pro Pro Ala Ser He Ala Asp Glu * 
325 



<210> 219 
<211> 384 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . . (384) 

<400> 219 

atg cgc cat cgt aag agt ggt cgt caa ctg aac cgc aac age age cat 

Met Arg His Arg Lys Ser Gly Arg Gin Leu Asn Arg Asn Ser Ser His 
1 5 10 15 

cgc cag get atg ttc cgc aat atg gca ggt tea ctg gtt cgt cat gaa 
Arg Gin Ala Met Phe Arg Asn Met Ala Gly Ser Leu Val Arg His Glu 
20 25 30 

ate ate aag acg act ctg cct aaa gcg aaa gag ctg cgc cgc gta gtt 
He He Lys Thr Thr Leu Pro Lys Ala Lys Glu Leu Arg Arg Val Val 
35 40 45 



624 



672 



720 



7 68 



816 



864 



912 



960 



990 



48 



96 



144 
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gag ccg ctg att act ctt gcc aag act gat age gtt get aat cgt cgt 192 

Glu Pro Leu He Thr Leu Ala Lys Thr Asp Ser Val Ala Asn Arg Arg 

50 55 60 

ctg gca ttc gcc cgt act cgt gat aac gag ate gtg gca aaa ctg ttt 24 0 

Leu Ala Phe Ala Arg Thr Arg Asp Asn Glu He Val Ala Lys Leu Phe 

65 " 70 75 80 

aac gaa ctg ggc ccg cgt ttc gcg age cgt gcc ggt ggt tac act cgt 288 

Asn Glu Leu Gly Pro Arg Phe Ala Ser Arg Ala Gly Gly Tyr Thr Arg 

85 90 95 

att ctg aag tgt ggc ttc cgt gca ggc gac aac gcg ccg atg get tac 336 

He Leu Lys Cys Gly Phe Arg Ala Gly Asp Asn Ala Pro Met Ala Tyr 

100 105 110 

ate gag ctg gtt gat cgt tea gag aaa gca gaa get get gca gag taa 384 

He Glu Leu Val Asp Arg Ser Glu Lys Ala Glu Ala Ala Ala Glu * 

115 120 125 



<210> 220 
<211> 732 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . . (732) 

<400> 220 

atg aag atg ttg cgc gat ccg ctg ttc tgg etc att get ctg ttt gtg 48 

Met Lys Met Leu Arg Asp Pro Leu Phe Trp Leu He Ala Leu Phe Val 

1 5 10 15 

gcg ctg att ttc tgg ctg cct tac age cag ccg ctg ttt get gcc ttg 96 
Ala Leu He Phe Trp Leu Pro Tyr Ser Gin Pro Leu Phe Ala Ala Leu 
20 * 25 30 

ttc cca caa ctg cca cga ccc gtt tat cag caa gaa agt ttt gca get 144 
Phe Pro Gin Leu Pro Arg Pro Val Tyr Gin Gin Glu Ser Phe Ala Ala 
35 40 45 

ctg gca ctg get cat ttc tgg ctg gtg gga att teg agt ttg ttt gcg 192 
Leu Ala Leu Ala His Phe Trp Leu Val Gly He Ser Ser Leu Phe Ala 
50 55 60 

gtg ate att ggc act ggt gcc gga att get gtc act cgc ccg tgg ggc 240 
Val He lie Gly Thr Gly Ala Gly He Ala Val Thr Arg Pro Trp Gly 
65 70 75 80 

gcg gaa ttt cgc cca ctg gtg gaa act att gcc gcc gtt gga cag act 288 
Ala Glu Phe Arg Pro Leu Val Glu Thr He Ala Ala Val Gly Gin Thr 
85 90 95 

ttt ccg ccc gtc gca gtg ctg gcg ate gcc gtt ccg gtg ate ggc ttt 336 
Phe Pro Pro Val Ala Val Leu Ala He Ala Val Pro Val He Gly Phe 
100 105 110 

ggt ctg caa cca gcg att ate gcc ttg ate ctt tac ggt gtg ctg ccc 384 
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Gly Leu Gin Pro Ala lie lie Ala Leu He Leu Tyr Gly Val Leu Pro 
115 120 125 

gtc ctg cag gcg aca ctt gcc ggg ctg gga gcg att gat gcc age gtg 432 
Val Leu Gin Ala Thr Leu Ala Gly Leu Gly Ala He Asp Ala Ser Val 
130 135 140 



aca gaa gtt gcg aaa ggt atg gga atg agt cgt ggt cag cga gtg cgt 
Thr Glu Val Ala Lys Gly Met Gly Met Ser Arg Gly Gin Arg Val Arg 
145 150 155 160 

aag gtc gag eta ccg ctg gcg get ccg gtg att ctg gcg ggc gtg cga 
Lys Val Glu Leu Pro Leu Ala Ala Pro Val He Leu Ala Gly Val Arg 
165 170 175 

act teg gtg att ate aac att ggt acg gcg acg ate gcc tea acg gta 
Thr Ser Val He He Asn He Gly Thr Ala Thr He Ala Ser Thr Val 
180 185 190 

ggg gcc age acg ctg ggt acg ccc ate ate ate ggg ctt age gga ttt 
Gly Ala Ser Thr Leu Gly Thr Pro He He He Gly Leu Ser Gly Phe 
195 200 205 

aat acc gcg tat gtg ate cag ggg gcg tta ctg gtg gca ctg gcg gcg 
Asn Thr Ala Tyr Val He Gin Gly Ala Leu Leu Val Ala Leu Ala Ala 
210 J 215 220 

ate ate gca gac cgc ctg ttt gaa agg ctg gtg cag gcg ctt age cag 
He He Ala Asp Arg Leu Phe Glu Arg Leu Val Gin Ala Leu Ser Gin 
225 230 235 240 

cac gca aaa taa 
His Ala Lys * 



<210> 221 
<211> 927 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . . (927) 

<400> 221 

atg att gaa ttt age cat gtc age aaa ctg ttc ggc gca caa aaa gcc 
Met He Glu Phe Ser His Val Ser Lys Leu Phe Gly Ala Gin Lys Ala 
15 10 15 

gtt aac gat etc aat etc aat ttt cag gaa ggg agt ttt teg gtg ctg 
Val Asn Asp Leu Asn Leu Asn Phe Gin Glu Gly Ser Phe Ser Val Leu 
20 25 30 



480 



528 



576 



624 



672 



720 



732 



48 



96 



att ggc aca tct ggc tec ggc aaa tec acc acc ctg aaa atg att aac 144 
He Gly Thr Ser Gly Ser Gly Lys Ser Thr Thr Leu Lys Met He Asn 
35 40 45 

cgc ctg gtg gag cat gac age gga gag ate cgc ttt gcc gga gaa gaa 192 
Arg Leu Val Glu His Asp Ser Gly Glu He Arg Phe Ala Gly Glu Glu 
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50 55 60 

att cgc teg ctg cca gta ctg gag ttg cgc cgc egg atg ggc tat gee 240 
lie Arg Ser Leu Pro Val Leu Glu Leu Arg Arg Arg Met Gly Tyr Ala 
65 70 75 80 

att caa tct att ggc ctg ttc ccc cac tgg age gtg gcg caa aac ate 288 
lie Gin Ser lie Gly Leu Phe Pro His Trp Ser Val Ala Gin Asn lie 
85 90 95 

get acc gtg ccg caa tta caa aaa tgg teg egg gcg egg att gac gat 336 
Ala Thr Val Pro Gin Leu Gin Lys Trp Ser Arg Ala Arg He Asp Asp 
100 105 110 

cgt ate gac gaa tta atg gcg eta ctg ggg ctg gag tea aat ttg cgt 384 
Arg He Asp Glu Leu Met Ala Leu Leu Gly Leu Glu Ser Asn Leu Arg 
115 120 125 



gag cgt tat ccg cat cag ctt tec ggt ggt cag cag caa cgt gtg gga 
Glu Arg Tyr Pro His Gin Leu Ser Gly Gly Gin Gin Gin Arg Val Gly 
130 135 140 



cgt ccg gcg aat gat ttt gtc cgc cag ttt ttt gga cgt agt gaa ctg 
Arg Pro Ala Asn Asp Phe Val Arg Gin Phe Phe Gly Arg Ser Glu Leu 
225 230 235 240 



gaa gaa cga gca gat ggt gag gca ctg gca gaa gag atg acg eta cgc 
Glu Glu Arg Ala Asp Gly Glu Ala Leu Ala Glu Glu Met Thr Leu Arg 
260 265 270 



432 



gtg gcg cgt gca ctg get gee gat ccg caa gtc tta eta atg gat gaa 480 

Val Ala Arg Ala Leu Ala Ala Asp Pro Gin Val Leu Leu Met Asp Glu 
145 150 155 160 

cct ttt ggc gca ctg gac ccg gta acg cgc ggc gcg ttg caa caa gag 528 

Pro Phe Gly Ala Leu Asp Pro Val Thr Arg Gly Ala Leu Gin Gin Glu 
165 170 175 

atg acg cgc att cac cgt ttg ctg ggg cgt acc att gtg ctg gtc act 576 

Met Thr Arg He His Arg Leu Leu Gly Arg Thr He Val Leu Val Thr 

180 185 190 

cat gat att gat gag gcg eta egg ctg gca gaa cat ctg gta ttg atg 624 

His Asp He Asp Glu Ala Leu Arg Leu Ala Glu His Leu Val Leu Met 

195 200 205 

gat cac ggt gaa gta gtg cag cag ggc aat ccg ctg acg atg ctg act 672 

Asp His Gly Glu Val Val Gin Gin Gly Asn Pro Leu Thr Met Leu Thr 
210 215 220 



720 



ggt gtg cgc ctg ctt teg tta cgt agt gtg gcg gat tac gtg cgt cgc 768 
Gly Val Arg Leu Leu Ser Leu Arg Ser Val Ala Asp Tyr Val Arg Arg 
245 250 255 



816 



gat gcg etc tct ctg ttt gtt gcg egg gga tgc gag gtg ctg ccg gtg 864 
Asp Ala Leu Ser Leu Phe Val Ala Arg Gly Cys Glu Val Leu Pro Val 
275 280 285 

gtg aac atg cag ggc cag cct tgc ggc acg ctg cat ttt cag gat ctg 912 
Val Asn Met Gin Gly Gin Pro Cys Gly Thr Leu His Phe Gin Asp Leu 
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290 



295 



ctg gtg gag gcg taa 
Leu Val Glu Ala * 
305 



300 



927 



<210> 222 
<211> 1158 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . . (1158) 

<400> 222 

gtg act tat ttc cgt att aat cct gtt ctg gcg ctg ctg ctg ttg ctg 

Met Thr Tyr Phe Arg lie Asn Pro Val Leu Ala Leu Leu Leu Leu Leu 
1 " 5 10 15 



att ccc ggt aaa aaa ggc age att tgt gca etc att ctg gcg caa ttc 
lie Pro Gly Lys Lys Gly Ser lie Cys Ala Leu lie Leu Ala Gin Phe 
65 70 75 80 

gtc ttc gta ttg ctg gtg tgg gga get gga aag gcg gcg acc caa ctg 
Val Phe Val Leu Leu Val Trp Gly Ala Gly Lys Ala Ala Thr Gin Leu 
85 90 95 



gec att att ccg ctg tgg ttg ctg tac tec ggc acg ctt aac gat etc 
Ala He He Pro Leu Trp Leu Leu Tyr Ser Gly Thr Leu Asn Asp Leu 
145 150 155 160 



48 



acg gca ate gca gcg gcg ctg ccg ttt ate agt tac gcg cct aat cgt 96 
Thr Ala He Ala Ala Ala Leu Pro Phe He Ser Tyr Ala Pro Asn Arg 
20 25 30 

tta gtt teg ggt gag ggg cgt cat etc tgg cag ctg tgg ccg caa acg 14 4 

Leu Val Ser Gly Glu Gly Arg His Leu Trp Gin Leu Trp Pro Gin Thr 
35 40 45 

ate tgg atg ctg gtg ggc gtt ggt tgc gec tgg ctg acg gee tgt ttt 192 
He Trp Met Leu Val Gly Val Gly Cys Ala Trp Leu Thr Ala Cys Phe 
50 55 60 



240 



288 



gcg caa aat ggc agt gcg ctg gcg cgt acc age etc ggc agt ggt ttc 336 
Ala Gin Asn Gly Ser Ala Leu Ala Arg Thr Ser Leu Gly Ser Gly Phe 
100 105 HO 

tgg ctg get gcg gcg ctg gca ttg ctg gec tgt age gat gec ate cgc 384 
Trp Leu Ala Ala Ala Leu Ala Leu Leu Ala Cys Ser Asp Ala He Arg 
115 120 125 

cga ate tec acg cat ccg ctg tgg cgc tgg ttg ttg cat atg cag att 4 32 

Arg He Ser Thr His Pro Leu Trp Arg Trp Leu Leu His Met Gin He 
130 135 .140 



480 



tea eta atg aaa gaa tac gee aac cgt cag gat gtg ttt gac gac gcg 528 
Ser Leu Met Lys Glu Tyr Ala Asn Arg Gin Asp Val Phe Asp Asp Ala 
165 ' 170 175 
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ctg gca caa cat ctg acg ttg ctg ttt ggt gcg gtg ctg cct gcg tta 57 6 

Leu Ala Gin His Leu Thr Leu Leu Phe Gly Ala Val Leu Pro Ala Leu 
180 185 190 

gtg att ggt gtg ccg ttg ggc ate tgg tgc tac ttt tec act get egg 624 

Val lie Gly Val Pro Leu Gly lie Trp Cys Tyr Phe Ser Thr Ala Arg 
195 200 205 

cag ggg gca att ttt tct ctg etc aat gtg att cag ace gtg cct teg 672 

Gin Gly Ala lie Phe Ser Leu Leu Asn Val lie Gin Thr Val Pro Ser 

210 215 220 

gtg gcg etc ttt ggc ctg ttg att gcg ccg ctt gee gcg ctg gtt acg 720 

Val Ala Leu Phe Gly Leu Leu lie Ala Pro Leu Ala Ala Leu Val Thr 

225 230 235 240 

gee ttt ccg tgg ctg ggg acg etc ggc ata gca gga ace gga atg aca 7 68 

Ala Phe Pro Trp Leu Gly Thr Leu Gly lie Ala Gly Thr Gly Met Thr 
245 250 255 

ccc gca ctg att gcg ctg gtg etc tat gee ttg ctg ccg ctg gtg cgc 816 

Pro Ala Leu lie Ala Leu Val Leu Tyr Ala Leu Leu Pro Leu Val Arg 
260 265 270 

ggc gtg gta gtc ggt ttg aac cag ate ccg cgc gat gtg ctg gaa age 8 64 

Gly Val Val Val Gly Leu Asn Gin lie Pro Arg Asp Val Leu Glu Ser 
275 "* 280 285 

gee aga gcg atg ggg atg age ggg gcg cag cgt ttc ctg cat gtt cag 912 

Ala Arg Ala Met Gly Met Ser Gly Ala Gin Arg Phe Leu His Val Gin 

290 295 300 

tta ccg ctg gcg tta ccg gta ttt ctg cgc age ctg egg gtg gtg atg 960 

Leu Pro Leu Ala Leu Pro Val Phe Leu Arg Ser Leu Arg Val Val Met 

305 310 315 ' 320 

gtg caa act gta ggt atg gcg gtg att gcg gcg tta ate ggc gca ggc 1008 

Val Gin Thr Val Gly Met Ala Val lie Ala Ala Leu He Gly Ala Gly 
325 330 335 

ggt ttt ggt gcg ctg gtt ttc cag ggg ctg eta age age gee att gat 1056 

Gly Phe Gly Ala Leu Val Phe Gin Gly Leu Leu Ser Ser Ala He Asp 
340 345 350 

tta gtg ttg ctg ggg gtg ate ccg gta att gtt ctg gcg gtg ctt ace 1104 

Leu Val Leu Leu Gly Val He Pro Val He Val Leu Ala Val Leu Thr 
355 360 365 

gac gcg ctg ttc gat ttg ctt ate gca ctg ctg aag gtg aaa cgt aat 1152 
Asp Ala Leu Phe Asp Leu Leu He Ala Leu Leu Lys Val Lys Arg Asn 

370 375 380 

gat tga 1158 

Asp * 

385 



<210> 223 
<211> 918 



-164- 



WO 01/34810 



PCT/US00/30950 



<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . . (918) 

<400> 223 

atg cca etc tta aag etc tgg gca ggt tea ctg gtt atg ttg gca gec 

Met Pro Leu Leu Lys Leu Trp Ala Gly Ser Leu Val Met Leu Ala Ala 

1 5 10 15 

gtg age ctg ccg eta cag gcg get tec ccc gtt aaa gtc ggt tea aaa 
Val Ser Leu Pro Leu Gin Ala Ala Ser Pro Val Lys Val Gly Ser Lys 
20 25 30 



teg gag cac aac aag tta ate tgg ctg acg ccc gcg cct gca aat aac 
Ser Glu His Asn Lys Leu lie Trp Leu Thr Pro Ala Pro Ala Asn Asn 
115 120 125 



48 



96 



ate gat ace gaa ggt gcg eta etc ggc aat ate att ttg cag gtg ctg 144 
lie Asp Thr Glu Gly Ala Leu Leu Gly Asn He He Leu Gin Val Leu 
35 40 45 

gaa age cac ggc gta ccg acg gtg aat aaa gtg caa etc gga acg act 192 
Glu Ser His Gly Val Pro Thr Val Asn Lys Val Gin Leu Gly Thr Thr 
50 55 .60 

cct gtg gtg cgc ggg gcg att act tec ggt gaa ctg gat ate tat ccg 240 
Pro Val Val Arg Gly Ala He Thr Ser Gly Glu Leu Asp He Tyr Pro 
65 70 75 80 

gaa tat ace ggc aat ggc get ttc ttc ttt aaa gat gaa aac gat gca 288 
Glu Tyr Thr Gly Asn Gly Ala Phe Phe Phe Lys Asp Glu Asn Asp Ala 
85 90 95 

gcg tgg aaa aac gcg cag caa ggt tac gag aaa gtc aaa aag etc gat 336 
Ala Trp Lys Asn Ala Gin Gin Gly Tyr Glu Lys Val Lys Lys Leu Asp 
100 105 HO 



384 



ace tgg act ate gee gtg cgt cag gat gtg gca gag aaa aat aaa etc 432 
Thr Trp Thr He Ala Val Arg Gin Asp Val Ala Glu Lys Asn Lys Leu 
130 135 140 

act teg ctt gee gac ctg agt cgt tat ctg caa gag ggc ggc ace ttc 480 
Thr Ser Leu Ala Asp Leu Ser Arg Tyr Leu Gin Glu Gly Gly Thr Phe 
145 150 - 155 - 160 

aaa ctg gca gee teg gca gag ttt ate gaa cgc gee gat gcg tta ccc 528 
Lys Leu Ala Ala Ser Ala Glu Phe He Glu Arg Ala Asp Ala Leu Pro 
165 170 - 175 

gcg ttt gaa aaa gee tat ggc ttt aag etc ggt cag gat cag ttg ctg 576 
Ala Phe Glu Lys Ala Tyr Gly Phe Lys Leu Gly Gin Asp Gin Leu Leu 
180 185 190 

tea ctg get ggt ggc gac acg gcg gtg acg ate aaa gee get gee cag 624 
Ser Leu Ala Gly Gly Asp Thr Ala Val Thr lie Lys Ala Ala Ala Gin 
195 " 200 205 
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caa acc tct ggc gtt aat get gca atg get tac ggc act gac ggc ccg 672 

Gin Thr Ser Gly Val Asn Ala Ala Met Ala Tyr Gly Thr Asp Gly Pro 
210 215 220 

gta gcg gcg ctg ggg ctg caa acc tta age gat ccg caa ggt gtg caa 720 

Val Ala Ala Leu Gly Leu Gin Thr Leu Ser Asp Pro Gin Gly Val Gin 

225 230 235 240 

cct ate tac gcg cct gca cca gtg gtg cgt gag tec gtg ttg agg gag 7 68 

Pro He Tyr Ala Pro Ala Pro Val Val Arg Glu Ser Val Leu Arg Glu 
245 250 255 

tat ccg caa atg gca cag tgg eta cag cca gtc ttc gcc age etc gat 816 

Tyr Pro Gin Met Ala Gin Trp Leu Gin Pro Val Phe Ala Ser Leu Asp 

260 265 270 

gca aaa aca ttg cag caa ctg aat gcc age att get gtg gaa gga ctg 8 64 

Ala Lys Thr Leu Gin Gin Leu Asn Ala Ser He Ala Val Glu Gly Leu 
275 280 285 

gat gcc aaa aaa gtg get gcc gac tac ctg aaa caa aaa ggg tgg acg 912 

*Asp Ala Lys Lys Val Ala Ala Asp Tyr Leu Lys Gin Lys Gly Trp Thr 
290 295 300 

aag taa 918 

Lys * 

305 



<210> 224 
<211> 624 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . • . (624) 

<400> 224 

atg etc gtc tat tgg ctg gat ata gtc ggc aca gcg gta ttt gcc ate 48 
Met Leu Val Tyr Trp Leu Asp He Val Gly Thr Ala Val Phe Ala He 
1 ^ 5 10 15 

tec ggc gtt ttg tta gcc gga aaa ttg cgt atg gac cct ttt ggt gtt 96 
Ser Gly Val Leu Leu Ala Gly Lys Leu Arg Met Asp Pro Phe Gly Val 
20 25 30 

ctg gta ctg ggc gtg gta acc gca gta ggc ggc ggg aca att cgc gac 144 
Leu Val Leu Gly Val Val Thr Ala Val Gly Gly Gly Thr He Arg Asp 
35 40 45 

atg gcg ctg gat cac ggc ccg gta ttt tgg gtg aaa gat ccc acc gat 192 
Met Ala Leu Asp His Gly Pro Val Phe Trp Val Lys Asp Pro Thr Asp 
50 " 55 60 

ctg gtc gtt gca atg gtc acc age atg ctg acc ate gtg ctg gtg cgc 240 
Leu Val Val Ala Met Val Thr Ser Met Leu Thr He Val Leu Val Arg 
65 70 75 80 

cag cca aga cgc tta cca aaa tgg atg ttg ccg gtg ctg gac gcc gtt 288 
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Gin Pro Arg Arg Leu Pro Lys Trp Met Leu Pro Val Leu Asp Ala Val 
85 ' 90 95 

ggt ctg gcg gtg ttt gtc ggc att ggc gtg aat aaa gcc ttt aat gcg 
Gly Leu Ala Val Phe Val Gly He Gly Val Asn Lys Ala Phe Asn Ala 
100 105 HO 

gaa gcc ggt ccg tta ate gcg gtt tgt atg ggc gtc att act ggc gtt 
Glu Ala Gly Pro Leu He Ala Val Cys Met Gly Val He Thr Gly Val 
115 120 125 



tgg cat ctt aag eta ccg acg ttt gcg ctg gat gag aat ggg cgt tga 
Trp His Leu Lys Leu Pro Thr Phe Ala Leu Asp Glu Asn Gly Arg * 
195 200 205 



<210> 225 
<211> 801 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . . (801) 

<400> 225 

atg get aag tea ctg ttc agg gcg ctg gtc gcc ctg tct ttt ctt gcg 

Met Ala Lys Ser Leu Phe Arg Ala Leu Val Ala Leu Ser Phe Leu Ala 

15 10 15 

cca ctg tgg etc aac gcc gcg ccg cgc gtc ate acg ctt tct ccc gcc 
Pro Leu Trp Leu Asn Ala Ala Pro Arg Val He Thr Leu Ser Pro Ala 
20 25 30 



336 



384 



ggc ggc ggg ate att cgt gat gtt ctg gcc cgc gaa ate ccc atg att 432 
Gly Gly Gly He He Arg Asp Val Leu Ala Arg Glu lie Pro Met He 
130 135 140 

tta cgt aca gaa ate tac gca act gcc tgt att ate ggc ggt att gtc 
Leu Arg Thr Glu He Tyr Ala Thr Ala Cys He He Gly Gly He Val 
145 " 150 155 160 



480 



cac get acg get tat tac aca ttt tec gta cca ctg gaa aca gcc agt 528 
His Ala Thr Ala Tyr Tyr Thr Phe Ser Val Pro Leu Glu Thr Ala Ser 
165 170 175 

atg atg ggc atg gtc gtg acg eta ttg att egg ctg gcg get att cgt 57 6 

Met Met Gly Met Val Val Thr Leu Leu He Arg Leu Ala Ala He Arg 
180 185 190 



624 



48 



96 



aac act gaa ctt gcc ttt gcc gcc ggg ate acg ccg gtt ggg gtc age 144 
Asn Thr Glu Leu Ala Phe Ala Ala Gly He Thr Pro Val Gly Val Ser 
35 40 45 

age tat tec gac tat cct cca caa gcg caa aag att gag cag gtt tec 192 
Ser Tyr Ser Asp Tyr Pro Pro Gin Ala Gin Lys He Glu Gin Val Ser 
50 55 60 

acc tgg cag ggg atg aat ctg gaa cgc att gtc gcg ctg aaa ccc gat 240 
Thr Trp Gin Gly Met Asn Leu Glu Arg He Val Ala Leu Lys Pro Asp 
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65 70 75 80 

ctg gtg att gcc tgg cgt gga ggt aat gcc gag egg cag gtt gac cag 288 
Leu Val lie Ala Trp Arg Gly Gly Asn Ala Glu Arg Gin Val Asp Gin 
85 90 95 

ctg get teg ctg gga ata aaa gtg atg tgg gtc gat gcg aca age att 336 
Leu Ala Ser Leu Gly lie Lys Val Met Trp Val Asp Ala Thr Ser lie 
100 105 110 

gaa caa att gcc aat gcg tta cgt caa ctg gcc ccc tgg agt ccg caa 384 
Glu Gin lie Ala Asn Ala Leu Arg Gin Leu Ala Pro Trp Ser Pro Gin 
115 120 125 

cca gac aag gcc gaa caa gcc gcg caa tec ctg ctg gat cag tac gcg 432 
Pro Asp Lys Ala Glu Gin Ala Ala Gin Ser Leu Leu Asp Gin Tyr Ala 
130 135 140 

caa ttg aaa gcg caa tat get gat aaa cct aaa aaa cgt gtt ttt ctg 480 
Gin Leu Lys Ala Gin Tyr Ala Asp Lys Pro Lys Lys Arg Val Phe Leu 
145 150 155 160 

caa ttc ggc att aat ccg cca ttt acc agt gga aaa gag teg att cag 528 
Gin Phe Gly lie Asn Pro Pro Phe Thr Ser Gly Lys Glu Ser lie Gin 
165 170 175 

aac cag gta etc gaa gtt tgt ggc gga gaa aac ate ttt aaa gac age 57 6 

Asn Gin Val Leu Glu Val Cys Gly Gly Glu Asn lie Phe Lys Asp Ser 
180 185 190 

egg gtt ccc tgg ccg caa gtt age cgc gaa cag gtg tta gca cgc teg 624 
Arg Val Pro Trp Pro Gin Val Ser Arg Glu Gin Val Leu Ala Arg Ser 
195 200 205 

cca cag gcg att gtc att aca ggc gga ccg gac caa att cct aaa ate 672 
Pro Gin Ala lie Val He Thr Gly Gly Pro Asp Gin He Pro Lys He 
210 215 220 

aaa caa tac tgg ggt gaa cag etc aaa att ccc gtt att cct etc acg 720 
Lys Gin Tyr Trp Gly Glu Gin Leu Lys He Pro Val He Pro Leu Thr 
225 230 235 240 

agt gac tgg ttt gaa cgt gca age cca cgt att ate etc get gca caa 768 
Ser Asp Trp Phe Glu Arg Ala Ser Pro Arg He He Leu Ala Ala Gin 
245 250 255 

cag etc tgt aat gcg ctt tea cag gta gat tag 801 
Gin Leu Cys Asn Ala Leu Ser Gin Val Asp * 
260 265 



<210> 226 
<211> 699 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . . (699) 
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<400> 226 

atg aaa ate ggc ate att ggt gca atg gaa gaa gaa gtt acg ctg ctg 48 
Met Lys He Gly lie He Gly Ala Met Glu Glu Glu Val Thr Leu Leu 
1 5 10 15 

cgt gac aaa ate gaa aac cgt caa act ate agt etc ggc ggt tgc gaa 96 
Arg Asp Lys He Glu Asn Arg Gin Thr He Ser Leu Gly Gly Cys Glu 
20 25 30 

ate tat ace ggc caa ctg aat gga ace gag gtt gcg ctt ctg aaa teg 144 
He Tyr Thr Gly Gin Leu Asn Gly Thr Glu Val Ala Leu Leu Lys Ser 
35 40 45 

ggc ate ggt aaa gtc get gcg gcg ctg ggt gee act ttg ctg ttg gaa 192 
Gly He Gly Lys Val Ala Ala Ala Leu Gly Ala Thr Leu Leu Leu Glu 
50 55 60 

cac tgc aag cca gat gtg att att aac ace ggt tct gee ggt ggc ctg 240 
His Cys Lys Pro Asp Val He He Asn Thr Gly Ser Ala Gly Gly Leu 
65 70 75 80 

gca cca acg ttg aaa gtg ggc gat ate gtt gtc teg gac gaa gca cgt 288 
Ala Pro Thr Leu Lys Val Gly Asp He Val Val Ser Asp Glu Ala Arg 
85 90 95 

tat cac gac gcg gat gtc acg gca ttt ggt tat gaa tac ggt cag tta 336 
Tyr His Asp Ala Asp Val Thr Ala Phe Gly Tyr Glu Tyr Gly Gin Leu 
100 105 HO 

cca ggc tgt ccg gca ggc ttt aaa get gac gat aaa ctg ate get gee 
Pro Gly Cys Pro Ala Gly Phe Lys Ala Asp Asp Lys Leu He Ala Ala 
115 120 125 



384 



get gag gee tgc att gee gaa ctg aat ctt aac get gta cgt ggc ctg' 4 32 

Ala Glu Ala Cys He Ala Glu Leu Asn Leu Asn Ala Val Arg Gly Leu 
130 135 140 

att gtt age ggc gac get ttc ate aac ggt tct gtt ggt ctg gcg aaa 4 80 

He Val Ser Gly Asp Ala Phe He Asn Gly Ser Val Gly Leu Ala Lys 
145 ™ " 150 155 160 

ate cgc cac aac ttc cca cag gec att get gta gag atg gaa gcg acg 528 
He Arg His Asn Phe Pro Gin Ala He Ala Val Glu Met Glu Ala Thr 
165 170 175 • 

gca ate gee cat gtc tgc cac aat ttc aac gtc ccg ttt gtt gtc gta 
Ala He Ala His Val Cys His Asn Phe Asn Val Pro Phe Val Val Val 
180 185 190 

cgc gee ate tec gac gtg gee gat caa cag tct cat ctt age ttc gat 624 
Arg Ala He Ser Asp Val Ala Asp Gin Gin Ser His Leu Ser Phe Asp 
195 200 205 

gag ttc ctg get gtt gee get aaa cag tec age ctg atg gtt gag tea 672 
Glu Phe Leu Ala Val Ala Ala Lys Gin Ser Ser Leu Met Val Glu Ser • 
210 215 220 



576 



ctg gtg cag aaa ctt gca cat ggc taa 
Leu Val Gin Lys Leu Ala His Gly * 
225 230 



699 
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<210> 227 
<211> 789 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1)...(789) 

<400> 227 

gtg att gat aaa tec gec ttt gtg cat cca acc gec att gtg gaa gag 48 

Met lie Asp Lys Ser Ala Phe Val His Pro Thr Ala lie Val Glu Glu 

1 * 5 10 15 

ggc gcg tea att ggc gcg aac gca cac att ggt cct ttt tgt ate gtt 96 
Gly Ala Ser lie Gly Ala Asn Ala His lie Gly Pro Phe Cys lie Val 
20 25 30 

gga ccc cat gtc gaa att ggt gag ggt acc gta ctg aaa tct cac gtt 144 
Gly Pro His Val Glu He Gly Glu Gly Thr Val Leu Lys Ser His Val 
35 40 45 

gtc gtg aat ggt cat act aaa att ggc cgc gat aat gag att tat cag 192 
Val Val Asn Gly His Thr Lys He Gly Arg Asp Asn Glu He Tyr Gin 
50 55 60 

ttc gec tec ate ggc gaa gtt aac cag gat ctg aaa tat get ggc gaa 240 
Phe Ala Ser He Gly Glu Val Asn Gin Asp Leu Lys Tyr Ala Gly Glu 
65 70 75 80 

ccg acc cgt gtg gaa ate ggc gat cgt aac cgc att cgc gaa age gtc 288 
Pro Thr Arg Val Glu lie Gly Asp Arg Asn Arg He Arg Glu Ser Val 
85 90 95 

acc att cat cgt ggc aca gtc cag ggc ggt gga ttg acg aag gtg ggc 336 
Thr He His Arg Gly Thr Val Gin Gly Gly Gly Leu Thr Lys Val Gly 
100 105 110 

age gac aac tta ctg atg ate aac gcg cac att gcg cac gat tgt acg 384 
Ser Asp Asn Leu Leu Met He Asn Ala His He Ala His Asp Cys Thr 
115 120 125 

gta ggt aac cgc tgt att etc gee aac aac gca acg ctg gcg ggt cac 432 
Val Gly Asn Arg Cys He Leu Ala Asn Asn Ala Thr Leu Ala Gly His 
130 135 140 

gta teg gtt gac gac ttc gcg ate ate ggc ggc atg acc gca gtc cat 480 
Val Ser Val Asp Asp Phe Ala He He Gly Gly Met Thr Ala Val His 
145 150 155 160 

cag ttc tgc ate att ggt gcg cac gtg atg gtt ggc ggc tgc tec ggt 528 
Gin Phe Cys He He Gly Ala His Val Met Val Gly Gly Cys Ser Gly 
165 170 175 

gtg gcg cag gac gtc cct cct tat gtc att gcg cag ggt aac cac gca 57 6 

Val Ala Gin Asp Val Pro Pro Tyr Val lie Ala Gin Gly Asn His Ala 
180 185 190 
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acg ccg ttc ggt gtc aat ate gaa ggg ctg aag cgc cgc gga ttc age 
Thr Pro Phe Gly Val Asn lie Glu Gly Leu Lys Arg Arg Gly Phe Ser 
195 200 205 

cgt gag gcg att ace get ate cgc aat gcg tat aag ctg att tat cgt 
Arg Glu Ala lie Thr Ala lie Arg Asn Ala Tyr Lys Leu He Tyr Arg 
210 215 220 

age ggt aaa acg etc gat gaa gtg aaa ccg gaa att get gaa ctg gcg 
Ser Gly Lys Thr Leu Asp Glu Val Lys Pro Glu He Ala Glu Leu Ala 
225 230 235 240 

gaa aca tat ccg gaa gtg aaa gee ttt ace gat ttc ttt gca cgc tea 
Glu Thr Tyr Pro Glu Val Lys Ala Phe Thr Asp Phe Phe Ala Arg Ser 
245 250 255 

acg cgc ggt ctg att cgt taa 
Thr Arg Gly Leu He Arg * 
260 



<210> 228 
<211> 1149 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1)...(1149) 

<400> 228 

atg act gaa cag cgt cca tta acg att gec ctg gtc gec gga gaa acc 
Met Thr Glu Gin Arg Pro Leu Thr He Ala Leu Val Ala Gly Glu Thr 
15 10 15 

tec ggc gat ate ctg ggg gec ggt tta ate cgc get ctg aaa gaa cat 
Ser Gly Asp He Leu Gly Ala Gly Leu He Arg Ala Leu Lys Glu His 
20 25 30 



624 



672 



720 



768 



789 



48 



96 



gtg ccc aac gec cgc ttt gtt ggt gtt gee ggg cca cga atg cag get 144 
Val Pro Asn Ala Arg Phe Val Gly Val Ala Gly Pro Arg Met Gin Ala 
35 40 45 

gaa ggc tgc gaa gee tgg tac gaa atg gaa gaa ctg gcg gtg atg ggc 192 
Glu Gly Cys Glu Ala Trp Tyr Glu Met Glu Glu Leu Ala Val Met Gly 
50 55 60 

att gtt gaa gtg etc ggt cgt ctg cgt cgc tta ctg cat att cgt gee 
He Val Glu Val Leu Gly Arg Leu Arg Arg Leu Leu His He Arg Ala 
65 70 75 80 

gat ctg aca aag cgt ttt ggc gaa ctg aag cca gat gtt ttt gtt ggt 
Asp Leu Thr Lys Arg Phe Gly Glu Leu Lys Pro Asp Val Phe Val Gly 
85 90 95 

att gat gcg cct gac ttc aat att act ctt gaa ggt aac etc aaa aag 
He Asp Ala Pro Asp Phe Asn lie Thr Leu Glu Gly Asn Leu Lys Lys 
100 .105 HO 

cag ggt ate aaa acc att cat tac gtc agt ccg tea gtc tgg gcg tgg 



240 



288 



336 



384 
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Gin Gly lie Lys Thr lie His Tyr Val Ser Pro Ser Val Trp Ala Trp 
115 120 125 

cga cag aaa cgt gtt ttc aaa ata ggc aga gcc acc gat ctg gtg etc 
Arg Gin Lys Arg Val Phe Lys lie Gly Arg Ala Thr Asp Leu Val Leu 
130 135 140 

gca ttt ctg cct ttc gaa aaa gcg ttt tat gac aaa tac aac gta ccg 
Ala Phe Leu Pro Phe Glu Lys Ala Phe Tyr Asp Lys Tyr Asn Val Pro 
145 150 155 160 

tgc cgc ttt ate ggt cat acc atg get gat gcc atg cca tta gat cca 
Cys Arg Phe lie Gly His Thr Met Ala Asp Ala Met Pro Leu Asp Pro 
165 170 175 

gat aaa aat gcc gcc cgt gat gtg ctg ggg ate cct cac gat gcc cac 
Asp Lys Asn Ala Ala Arg Asp Val Leu Gly lie Pro His Asp Ala His 
180 185 190 

tgc ctg gcg ttg eta ccg ggg age cgt ggt gca gaa gtt gaa atg ctt 
Cys Leu Ala Leu Leu Pro Gly Ser Arg Gly Ala Glu Val Glu Met Leu 
195 200 205 

agt gcc gat ttc ctg aaa acg gcc cag ctt ttg cgc cag aca tat ccg 
Ser Ala Asp Phe Leu Lys Thr Ala Gin Leu Leu Arg Gin Thr Tyr Pro 
210 215 220 

gat etc gaa ate gtg gtg cca ctg gtg aat gcc aaa cgc cgc gag cag 
Asp Leu Glu lie Val Val Pro Leu Val Asn Ala Lys Arg Arg Glu Gin 
225 230 235 240 



432 



480 



528 



576 



624 



672 



720 



864 



ttt gaa cgc ate aaa get gaa gtc gcg cca gac ctt tea gtt cat ttg 768 
Phe Glu Arg He Lys Ala Glu Val Ala Pro Asp Leu Ser Val His Leu 
245 250 255 

ctg gat ggg atg ggc cgt gag gcg atg gtc gcc age gat gcg gcg eta 816 
Leu Asp Gly Met Gly Arg Glu Ala Met Val Ala Ser Asp Ala Ala Leu 
260 265 270 

ctg gcg teg ggt acg gca gcc ctg gag tgt atg ctg gcg aaa tgc ccg 
Leu Ala Ser Gly Thr Ala Ala Leu Glu Cys Met Leu Ala Lys Cys Pro 
275 280 285 

atg gtg gtg gga tat cgc atg aag cct ttt acc ttc tgg ttg gcg aag 912 
Met Val Val Gly Tyr Arg Met Lys Pro Phe Thr Phe Trp Leu Ala Lys 
290 295 300 

egg ctg gtg aaa act gat tat gtc teg ctg cca aat ctg ctg gcg ggc 960 
Arg Leu Val Lys Thr Asp Tyr Val Ser Leu Pro Asn Leu Leu Ala Gly 
305 310 315 320 

aga gag tta gtc aaa gaa tta ttg cag gaa gag tgt gag ccg caa aaa 1008 
Arg Glu Leu Val Lys Glu Leu Leu Gin Glu Glu Cys Glu Pro Gin Lys 
325 330 335 



ctg get gcg gcg ctg tta ccg ctg ttg gcg aac ggg aaa acc age cac 

Leu Ala Ala Ala Leu Leu Pro Leu Leu Ala Asn Gly Lys Thr Ser His 

340 345 350 

gcg atg cac gat acc ttc cgt gaa ctg cat cag cag ate cgc tgc aat 



1056 



1104 
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Ala Met His Asp Thr Phe Arg Glu Leu His Gin Gin lie Arg Cys Asn 
355 360 365 

gcc gat gag cag gcg gca caa gcc gtt ctg gag tta gca caa tga 1149 
Ala Asp Glu Gin Ala Ala Gin Ala Val Leu Glu Leu Ala Gin * 
370 375 380 



<210> 229 
<211> 597 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . . (597) 

<400> 229 

atg ate gaa ttt gtt tat ccg cac acg cag ctg gtt gcg ggt gtg gat 48 
Met He Glu Phe Val Tyr Pro His Thr Gin Leu Val Ala Gly Val Asp 
1 5 10 15 

gaa gtc gga cgc ggg ccg tta gtt ggc gcg gtc gtc acc get gcg gtg 96 
Glu Val Gly Arg Gly Pro Leu Val Gly Ala Val Val Thr Ala Ala Val 
20 25 30 

ate ctt gac ccg gcg cgc ccg att gcc ggg ctg aat gat tec aaa aag 144 
He Leu Asp Pro Ala Arg Pro lie Ala Gly Leu Asn Asp Ser Lys Lys 
35 40 45 

ctg age gaa aaa cgc cgt ctg gcg etc tat gaa gag ate aaa gag aaa 192 
Leu Ser Glu Lys Arg Arg Leu Ala Leu Tyr Glu Glu He Lys Glu Lys 
50 55 60 

gcg ttg age tgg agt ctg ggc cgc gcg gaa ccc cac gaa ate gac gag 240 
Ala Leu Ser Trp Ser Leu Gly Arg Ala Glu Pro His Glu He Asp Glu 
65 70 75 80 

ctg aac att ctt cat gcg acc atg ctg gcg atg cag cgt gcc gtc get 288 
Leu Asn He Leu His Ala Thr Met Leu Ala Met Gin Arg Ala Val Ala 
85 90 95 

ggg ctg cat att gcg ccg gaa tat gtg ttg att gat ggt aac cgc tgc 336 
Gly Leu His He Ala Pro Glu Tyr Val Leu He Asp Gly Asn Arg Cys 
100 105 110 

ccg aaa tta ccg atg cct gcg atg get gtg gtg aaa ggc gat age cgc 384 
Pro Lys Leu Pro Met Pro Ala Met Ala Val Val Lys Gly Asp Ser Arg 
115 120 125 

gta ccg gaa ate agt gcc gcg tct ate ctg gcg aaa gtg acg cgt gac 4 32 

Val Pro Glu lie Ser Ala Ala Ser He Leu Ala Lys Val Thr Arg Asp 
130 135 140 

gcc gaa atg gcg gcg ctg gat att gtt ttc ccg caa tat ggt ttt gcc 480 
Ala Glu Met Ala Ala Leu Asp He Val Phe Pro Gin Tyr Gly Phe Ala 
145 150 155 160 

caa cac aaa ggg tac cca acc get ttt cat ctg gaa aaa ctg get gaa 528 
Gin His Lys Gly Tyr Pro Thr Ala Phe His Leu Glu Lys Leu Ala Glu 
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165 

cac ggc gcg acc gaa cac cat egg cgc 
His Gly Ala Thr Glu His His Arg Arg 
180 185 

gca ctg gga ctt gcg tec tga 
Ala Leu Gly Leu Ala Ser + 
195 



PCT/USOO/30950 

170 175 

age ttt ggg cct gtc aaa cgc 576 
Ser Phe Gly Pro Val Lys Arg 
190 

5 97 



<210> 230 
<211> 3483 
<2L2> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . • (3483) 

<400> 230 

atg tct gaa cca cgt ttc gta cac ctg egg gtg cac age gac tac teg 48 
Met Ser Glu Pro Arg Phe Val His Leu Arg Val His Ser Asp Tyr Ser 
15 10 15 

atg ate gat ggc ctg gee aaa acc gca ccg ttg gta aaa aag gcg gcg 96 
Met lie Asp Gly Leu Ala Lys Thr Ala Pro Leu Val Lys Lys Ala Ala 
20 25 30 

gcg ttg ggt atg cca gca ctg gcg ate acc gat ttc acc aac ctt tgt 144 
Ala Leu Gly Met Pro Ala Leu Ala lie Thr Asp Phe Thr Asn Leu Cys 
35 40 45 

ggt ctg gtg aag ttc tac gga gcg gga cat ggc gca ggg att aag cct 192 
Gly Leu Val Lys Phe Tyr Gly Ala Gly His Gly Ala Gly lie Lys Pro 
50 55 60 

ate gtc ggg gca gat ttt aac gtc cag tgc gac ctg ctg ggt gat gag 240 
lie Val Gly Ala Asp Phe Asn Val Gin Cys Asp Leu Leu Gly Asp Glu 
65 ' 70 75 80 

tta acc cac ctg acg gta ctg gcg gcg aac aat acc ggc tat cag aat 288 
Leu Thr His Leu Thr Val Leu Ala Ala Asn Asn Thr Gly Tyr Gin Asn 
85 90 95 

ctg acg ttg ctg ate tea aaa gcg tat cag cgc ggg tac ggt gec gee 336 
Leu Thr Leu Leu He Ser Lys Ala Tyr Gin Arg Gly Tyr Gly Ala Ala 
100 105 110 

ggg ccg ate ate gat cgc gac tgg ctt ate gaa tta aac gaa ggg ttg 384 
Gly Pro He He Asp Arg Asp Trp Leu He Glu Leu Asn Glu Gly Leu 
115 120 125 

ate ctt ctt tec ggc gga cgc atg ggc gac gtc gga cgc agt ctt ttg 432 
He Leu Leu Ser Gly Gly Arg Met Gly Asp Val Gly Arg Ser Leu Leu 
130 135 140 

cgt ggt aac age gcg ctg gta gat gag tgt gtc gcg ttt tat gaa gaa 480 
Arg Gly Asn Ser Ala Leu Val Asp Glu Cys Val Ala Phe Tyr Glu Glu 
145 150 155 160 
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cac ttc ccg gat cgc tat ttt etc gag ctg ate cgc ace ggc agg ccg 528 
His Phe Pro Asp Arg Tyr Phe Leu Glu Leu lie Arg Thr Gly Arg Pro 
165 170 175 

gat gaa gaa age tat ctg cac gcg gcg gtg gaa ctg gcg gaa gcg cgc 576 
Asp Glu Glu Ser Tyr Leu His Ala Ala Val Glu Leu Ala Glu Ala Arg 
180 185 190 

ggt ttg ccc gtc gtg gcg ace aac gac gtg cgc ttt ate gac age age 624 
Gly Leu Pro Val Val Ala Thr Asn Asp Val Arg Phe lie Asp Ser Ser 
195 200 205 

gac ttt gac gca cac gaa ate cgc gtc gcg ate cac gac ggc ttt ace 672 
Asp Phe Asp Ala His Glu lie Arg Val Ala He His Asp Gly Phe Thr 
210 * 215 220 

etc gac gat cct aaa cgc ccg cgt aac tat teg ccg cag caa tat atg 720 
Leu Asp Asp Pro Lys Arg Pro Arg Asn Tyr Ser Pro Gin Gin Tyr Met 
225 230 235 240 

cgt age gaa gag gag atg tgt gag ctg ttt gee gac ate ccc gaa gee 768 
Arg Ser Glu Glu Glu Met Cys Glu Leu Phe Ala Asp He Pro Glu Ala 
245 250 -255 

ctt gee aac ace gtt gag ate gee aaa cgc tgt aac gta acc gtg cgt 816 
Leu Ala Asn Thr Val Glu He Ala Lys Arg Cys Asn Val Thr Val Arg 
260 265 270 

ctt ggt gaa tac ttc ctg ccg cag ttc ccg acc ggg gac atg age acc 8 64 

Leu Gly Glu Tyr Phe Leu Pro Gin Phe Pro Thr Gly Asp Met Ser Thr 
275 280 285 

gaa gat tat ctg gtc aag cgt gca aaa gag ggc ctg gaa gag cgt ctg 912 
Glu Asp Tyr Leu Val Lys Arg Ala Lys Glu Gly Leu Glu Glu Arg Leu 
290 295 300 

gee ttt tta ttc cct gat gag gaa gaa cgt ctt aag cgc cgc ccg gaa 960 
Ala Phe Leu Phe Pro Asp Glu Glu Glu Arg Leu Lys Arg Arg Pro Glu 
305 310 315 320 

tat gac gaa cgt ctg gag act gaa ctt cag gtt ate aac cag atg ggc 1008 
Tyr Asp Glu Arg Leu Glu Thr Glu Leu Gin Val He Asn Gin Met Gly 
325 330 335 

ttc ccg ggc tac ttc etc ate gtt atg gaa ttt ate cag tgg teg aaa 1056 
Phe Pro Gly Tyr Phe Leu He Val Met Glu Phe He Gin Trp Ser Lys 
340 . 345 350 

gat aac ggc gta ccg gta ggg cca ggc cgt ggc tec ggt gcg ggt tea 1104 
Asp Asn Gly Val Pro Val Gly Pro Gly Arg Gly Ser Gly Ala Gly Ser 
355 360 365 

ctg gtg gee tac gcg ctg aaa ate acc gac etc gat ccg ctg gaa ttt 1152 
Leu Val Ala Tyr Ala Leu Lys He Thr Asp Leu Asp Pro Leu Glu Phe 
370 375 380 

gac ctg ctg ttc gaa cgt ttc ctt aac ccg gaa cgt gtc tec atg cct 1200 
Asp Leu Leu Phe Glu Arg Phe Leu Asn Pro Glu Arg Val Ser Met Pro 
385 390 395 400 
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gac ttc gac gtt gac ttc tgt atg gag aaa cgc gat cag gtt ate gag 1248 

Asp Phe Asp Val Asp Phe Cys Met Glu Lys Arg Asp Gin Val lie Glu 

405 410 415 

cac gta gcg gac atg tac ggt cgt gat gcg gta teg cag ate ate acc 1296 

His Val Ala Asp Met Tyr Gly Arg Asp Ala Val Ser Gin lie lie Thr 

420 425 430 

ttc ggt aca atg gcg gcg aaa gcg gtg ate cgc gac gta ggc cgc gtg 1344 

Phe Gly Thr Met Ala Ala Lys Ala Val He Arg Asp Val Gly Arg Val 
435 440 445 

ctg ggg cat ccg tac ggc ttt gtc gat cgt ate teg aaa ctg ate ccg 1392 

Leu Gly His Pro Tyr Gly Phe Val Asp Arg He Ser Lys Leu He Pro 
450 455 460 

ccc gat ccg ggg atg acg ctg gcg aaa gcg ttt gaa gec gag ccg cag 14 40 

Pro Asp Pro Gly Met Thr Leu Ala Lys Ala Phe Glu Ala Glu Pro Gin 

465 470 475 480 

ctg ccg gaa ate tac gaa gcg gat gaa gaa gtt aag gcg ctg ate gac 1488 

Leu Pro Glu lie Tyr Glu Ala Asp Glu Glu Val Lys Ala Leu He Asp 

485 490 495 

atg gcg cgc aaa ctg gaa ggg gtc acc cgt aac gec ggt aag cac gec 1536 

Met Ala Arg Lys Leu Glu Gly Val Thr Arg Asn Ala Gly Lys His Ala 

500 505 510 

ggt ggg gtg gtt ate gcg ccg acc aaa att acc gat ttt gcg ccg ctt 1584 
Gly Gly Val Val He Ala Pro Thr Lys He Thr Asp Phe Ala Pro Leu 
515 520 525 

tac tgc gat gaa gag ggc aaa cat ccg gtc acc cag ttt gat aaa age 1632 

Tyr Cys Asp Glu Glu Gly Lys His Pro Val Thr Gin Phe Asp Lys Ser 
530 535 540 



gac gtt gaa tac gee gga ctg gtg aag ttc gac ttc ctt ggt ttg cgt 
Asp Val Glu Tyr Ala Gly Leu Val Lys Phe Asp Phe Leu Gly Leu Arg 
545 550 555 560 



1680 



acg etc acc ate ate aac tgg gcg ctg gag atg ate aac aag egg egg 1728 

Thr Leu Thr He He Asn Trp Ala Leu Glu Met He Asn Lys Arg Arg 

565 570 575 

gcg aag aat ggc gag ccg ccg ctg gat ate get gcg ate ccg ctg gat 1776 

Ala Lys Asn Gly Glu Pro Pro Leu Asp He Ala Ala He Pro Leu Asp 

580 585 590 

gat aag aaa age ttc gac atg ctg caa cgc teg gaa acc acg gcg gta 1824 

Asp Lys Lys Ser Phe Asp Met Leu Gin Arg Ser Glu Thr Thr Ala Val 

595 600 605 

ttc cag ctt gaa teg cgc ggc atg aag gac ctg ate aag cgt eta caa 1872 

Phe Gin Leu Glu Ser Arg Gly Met Lys Asp Leu He Lys Arg Leu Gin 
610 615 620 

cct gac tgc ttc gaa gat atg ate gec eta gtg gca ctg ttc cgc ccc 1920 

Pro Asp Cys Phe Glu Asp Met He Ala Leu Val Ala Leu Phe Arg Pro 
625 630 635 640 
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ggt ccg ttg caa tea ggg atg gtg gat aac ttt ate gac cgt aaa cat 1968 
Gly Pro Leu Gin Ser Gly Met Val Asp Asn Phe lie Asp Arg Lys His 
64 5 650 655 

ggt cgt gaa gag ate tec tat ccg gac gta cag tgg cag cat gaa age 2016 
Gly Arg Glu Glu lie Ser Tyr Pro Asp Val Gin Trp Gin His Glu Ser 
660 665 670 

ctg aaa ccg gta ctg gag cca ace tac ggc att ate ctg tat cag gaa 2064 
Leu Lys Pro Val Leu Glu Pro Thr Tyr Gly lie lie Leu Tyr Gin Glu 
675 680 685 

cag gtc atg cag att gcg cag gtg ctt tct ggt tat ace etc ggt ggc 2112 
Gin Val Met Gin lie Ala Gin Val Leu Ser Gly Tyr Thr Leu Gly Gly 
690 695 700 

gcg gat atg ctg cgt cgt gcg atg ggt aag aaa aag ccg gaa gag atg 2160 
Ala Asp Met Leu Arg Arg Ala Met Gly Lys Lys Lys Pro Glu Glu Met 
705 710 715 720 

get aag caa cgt tct gta ttt get gaa ggt gca gaa aag aac gga ate 2208 
Ala Lys Gin Arg Ser Val Phe Ala Glu Gly Ala Glu Lys Asn Gly lie 
725 730 735 

aac get gaa ctg gcg atg aaa ate ttc gac ctg gtg gag aaa ttc get 2256 
Asn Ala Glu Leu Ala Met Lys lie Phe Asp Leu Val Glu Lys Phe Ala 
740 745 750 

ggt tac gga ttt aac aaa teg cac tct gcg gec tat get ttg gtg tea 2304 
Gly Tyr Gly Phe Asn Lys Ser His Ser Ala Ala Tyr Ala Leu Val Ser 
755 760 765 

tat caa acg tta tgg ctg aaa gcg cac tat cct gcg gag ttt atg gcg 2352 
Tyr Gin Thr Leu Trp Leu Lys Ala His Tyr Pro Ala Glu Phe Met Ala 
770 775 * 780 

gcg gta atg ace gee gat atg gac aac ace gag aag gtg gtg ggt ctg 2400 
Ala Val Met Thr Ala Asp Met Asp Asn Thr Glu Lys Val Val Gly Leu . 
785 790 795 800 

gtg gat gag tgc tgg egg atg ggg ctg aaa ate ctg cca cca gat ata 244 8 

Val Asp Glu Cys Trp Arg Met Gly Leu Lys lie Leu Pro Pro Asp He 
805 810 815 

aac tec ggt ctt tac cat ttc cac gtc aac gac gac ggc gaa ate gtg 24 96 
Asn Ser Gly Leu Tyr His Phe His Val Asn Asp Asp Gly Glu lie Val 
820 825 830 

tat ggt att ggc gcg ate aaa ggg gtc ggt gaa ggt ccg att gag gee 2544 
Tyr Gly He Gly Ala He Lys Gly Val Gly Glu Gly Pro He Glu Ala 
835 840 845 

ate ate gaa gee cgt aat aaa ggc ggc tac ttc cgc gaa ctg ttt gat 2592 
He He Glu Ala Arg Asn Lys Gly Gly Tyr Phe Arg Glu Leu Phe Asp 
850 855 860 

etc tgc gee cgt ace gac ace aaa aag ttg aac cgt cgc gtg ctg gaa 2640 
Leu Cys Ala Arg Thr Asp Thr Lys Lys Leu Asn Arg Arg Val Leu Glu 
865 870 875 880 
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aaa ctg ate atg tec ggg gcg ttt gac cgt ctt ggg cca cat cgc gca 2688 

Lys Leu lie Met Ser Gly Ala Phe Asp Arg Leu Gly Pro His Arg Ala 

885 890 895 

gcg ctg atg aac teg ctg ggc gat gcg tta aaa gcg gca gat caa cac 2736 

Ala Leu Met Asn Ser Leu Gly Asp Ala Leu Lys Ala Ala Asp Gin His 
900 905 910 

gcg aaa gcg gaa get ate ggt cag gec gat atg ttc ggc gtg ctg gec 2784 

Ala Lys Ala Glu Ala lie Gly Gin Ala Asp Met Phe Gly Val Leu Ala 
915 920 925 

gaa gag ccg gaa caa att gaa caa tec tac gec age tgc caa ccg tgg 2832 

Glu Glu Pro Glu Gin lie Glu Gin Ser Tyr Ala Ser Cys Gin Pro Trp 
930 935 940 

ccg gag cag gtg gta tta gat ggg gaa cgt gaa acg tta ggc ctg tac 2880 

Pro Glu Gin Val Val Leu Asp Gly Glu Arg Glu Thr Leu Gly Leu Tyr 
945 950 955 960 

ctg acc gga cac cct ate aac cag tat tta aaa gag att gag cgt tat 2928 

Leu Thr Gly His Pro He Asn Gin Tyr Leu Lys Glu He Glu Arg Tyr 

965 970 975 

gtc gga ggc gta agg ctg aaa gac atg cac ccg aca gaa cgt ggt aaa 297 6 

Val Gly Gly Val Arg Leu Lys Asp Met His Pro Thr Glu Arg Gly Lys 
980 985 990 

gtc ate acg get gcg ggg etc gtt gtt gee gcg egg gtt atg gtc acc 3024 

Val He Thr Ala Ala Gly Leu Val Val Ala Ala Arg Val Met Val Thr 
995 " 1000 1005 

aag cgc ggc aat cgt ate ggt ate tgc acg ctg gat gac cgt tec ggg 3072 

Lys Arg Gly Asn Arg He Gly He Cys Thr Leu Asp Asp Arg Ser Gly 
1010 1015 1020 

egg ctg gaa gtg atg ttg ttt act gac gee ctg gat aaa tac cag caa 3120 

Arg Leu Glu Val Met Leu Phe Thr Asp Ala Leu Asp Lys Tyr Gin Gin 
1025 1030 1035 1040 



ttg ctg gaa aaa gac cgc ata ctt ate gtc age gga cag gtc age ttt 
Leu Leu Glu Lys Asp Arg He Leu He Val Ser Gly Gin Val Ser Phe 
1045 1050 1055 



3168 



gat gac ttc age ggt ggg ctt aaa atg acc get cgc gaa gtg atg gat 3216 

Asp Asp Phe Ser Gly Gly Leu Lys Met Thr Ala Arg Glu Val Met Asp 

1060 1065 1070 

att gac gaa gee egg gaa aaa tat get cgc ggg ctt get ate teg ctg 3264 

He Asp Glu Ala Arg Glu Lys Tyr Ala Arg Gly Leu Ala He Ser Leu 

1075 " 1080 1085 

acg gac agg caa att gat gac cag ctt tta aac cga etc cgt cag tct 3312 

Thr Asp Arg Gin He Asp Asp Gin Leu Leu Asn Arg Leu Arg Gin Ser 

1090 1095 1100 

ctg gaa ccc cac cgc tct ggg aca att cca gta cat etc tac tat cag 3360 

Leu Glu Pro His Arg Ser Gly Thr lie Pro Val His Leu Tyr Tyr Gin 

1105 1110 1115 1120 
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agg gcg gat gca cgc gcg egg ttg cgt ttt ggc gcg acg tgg cgt gtc 3408 
Arg Ala Asp Ala Arg Ala Arg Leu Arg Phe Gly Ala Thr Trp Arg Val 
1125 1130 1135 

tct ccg age gat cgt tta tta aac gat etc cgt ggc etc att ggt teg 34 56 
Ser Pro Ser Asp Arg Leu Leu Asn Asp Leu Arg Gly Leu He Gly Ser 
1140 1145 1150 

gag cag gtg gaa ctg gag ttt gac taa 3483 
Glu Gin Val Glu Leu Glu Phe Asp * 
1155 H60 



<210> 231 
<211> 720 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> {1} . . . (720) 

<400> 231 

gtg aat gtc aat ttc ttt gtc acc tgt att ggt gac gec ctg aaa tea 48 
Met Asn Val Asn Phe Phe Val Thr Cys He Gly Asp Ala Leu Lys Ser 



15 10 15 

aga atg gca cga gac tec gtg ctg eta ctg gaa aaa etc ggc tgt cgc 
Arg Met Ala Arg Asp Ser Val Leu Leu Leu Glu Lys Leu Gly Cys Arg 
20 25 30 



tgg gca tea cgt gec gca aag gtt gee gcg cgt atg cag gat etc acc 
Trp Ala Ser Arg Ala Ala Lys Val Ala Ala Arg Met Gin Asp Leu Thr 
100 105 110 



96 



gta aat ttc ccg gag aaa cag gga tgc tgc ggt cag cct gcg ate aat 144 
Val Asn Phe Pro Glu Lys Gin Gly Cys Cys Gly Gin Pro Ala He Asn 
35 40 45 

age ggt tat ate aaa gaa gcg att cca ggg atg aaa aat ctg ate gee 
Ser Gly Tyr He Lys Glu Ala He Pro Gly Met Lys Asn Leu He Ala 
50 55 60 



192 



gca ctg gag gat aac gac gat ccc att att tea ccg get ggc tct tgc 240 
Ala Leu Glu Asp Asn Asp Asp Pro Ho He Ser Pro Ala Gly. Ser Cys 
65 70 75 80 

acc tat gee gta aaa agt tac ccg acg tat ctg gcg gat gaa cct gaa 288 
Thr Tyr Ala Val Lys Ser Tyr Pro Thr Tyr Leu Ala Asp Glu Pro Glu 
85 90 95 



336 



tct ttt att gtt aat aaa tta ggg gta gtc gat gta ggt gee agt ttg 384 
Ser Phe He Val Asn Lys Leu Gly Val Val Asp Val Gly Ala Ser Leu 
115 120 125 

caa ggg aga gcg gtg tat cac cca tct tgt age ctg gee cgt aag ctg 432 
Gin Gly Arg Ala Val Tyr His Pro Ser Cys Ser Leu Ala Arg Lys Leu 
130 135 140 
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gga gtg aag gac gag cca ctt acg ctg 
Gly Val Lys Asp Glu Pro Leu Thr Leu 
145 150 

gag ctg ttg acc ttt get gaa cag gat 
Glu Leu Leu Thr Phe Ala Glu Gin Asp 
165 

acg ttc teg gtc aaa atg gec gaa ata 
Thr Phe Ser Val Lys Met Ala Glu He 
180 185 

aag gtt gcg cac ctg atg gaa gtc cgc 
Lys Val Ala His Leu Met Glu Val Arg 
195 200 

gac gtg agt tgc ctg eta aac ate agt 
Asp Val Ser Cys Leu Leu Asn He Ser 
210 215 

cag aaa gtc aaa gtg atg cat att get 
Gin Lys Val Lys Val Met His He Ala 
225 230 



ctg aaa aat gtg cgt gga ctg 4 80 

Leu Lys Asn Val Arg Gly Leu 

155 160 

acc tgc tgc gga ttt ggc ggc 528 

Thr Cys Cys Gly Phe Gly Gly 
170 " 175 

tec ggc gag atg gtg aaa gaa 576 

Ser Gly Glu Met Val Lys Glu 
190 

cct gag tat tta att ggt get 624 

Pro Glu Tyr Leu He Gly Ala 
205 

ggg cga tta caa egg gaa ggg 672 

Gly Arg Leu Gin Arg Glu Gly 
220 

gaa gtg ttg atg age cgc tga 720 

Glu Val Leu Met Ser Arg * 
235 



<210> 232 
<211> 1428 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . . (1428) 

<400> 232 

atg teg ate aaa acc agt aat aca gat ttt aag aca cgc ate cgt cag 48 

Met Ser He Lys Thr Ser Asn Thr Asp Phe Lys Thr Arg He Arg Gin 
15 10 15 

caa att gaa gat ccg ate atg cgc aaa gcg gtg gca aac gcg cag cag 96 
Gin He Glu Asp Pro He Met Arg Lys Ala Val Ala Asn Ala Gin Gin 
20 25 30 

cgt att ggg gca aat egg caa aaa atg gtc gat gaa ttg ggg cac tgg 14 4 

Arg He Gly Ala Asn Arg Gin Lys Met Val Asp Glu Leu Gly His Trp 
35 40 45 

gag gag tgg cgc gat egg gee gee cag ata cgt gat cat gtt ctg agt 192 
Glu Glu Trp Arg Asp Arg Ala Ala Gin He Arg Asp His Val Leu Ser 
50 " 55 60 

aat etc gac get tat ctg tac cag etc tea gaa aaa gtg acg caa aac 240 
Asn Leu Asp Ala Tyr Leu Tyr Gin Leu Ser Glu Lys Val Thr Gin Asn 
65 70 75 80 

ggc ggt cac gtc tat ttt gca aga acc aaa gaa gac get acc cgc tac 288 
Gly Gly His Val Tyr Phe Ala Arg Thr Lys Glu Asp Ala Thr Arg Tyr 
85 90 95 

att tta cag gtt gec caa cgc aaa aat gee egg aag gtg gtg aaa tct 336 
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lie Leu Gin Val Ala Gin Arg Lys Asn Ala Arg Lys Val Val Lys Ser 
100 105 110 

aaa teg atg gtg acc gaa gag att ggt gtc aat cat gtg ttg cag gat 384 
Lys Ser Met Val Thr Glu Glu lie Gly Val Asn His Val Leu Gin Asp 
115 120 125 



get ggc att cag gtg att gaa acc gat ctg ggt gaa tat att etc cag 

Ala Gly lie Gin Val He Glu Thr Asp Leu Gly Glu Tyr He Leu Gin 
130 135 140 

ctg gat caa gat ccg cca tct cat gtt gtg gtc ccg gca att cat aaa 

Leu Asp Gin Asp Pro Pro Ser His Val Val Val Pro Ala He His Lys 
145 150 155 160 

gat cgc cat cag ate cgt cga gtg eta cac gaa cgt ctg ggc tat gag 

Asp Arg His Gin He Arg Arg Val Leu His Glu Arg Leu Gly Tyr Glu 

165 170 175 



cgc gaa gat ttc etc agt get gaa ata ggt att acc ggc tgt aat ttc 
Arg Glu Asp Phe Leu Ser Ala Glu He Gly He Thr Gly Cys Asn Phe 
195 200 205 

gcg gtg gca gag acc ggt teg gta tgc ctg gtg acc aat gaa ggt aat 
Ala Val Ala Glu Thr Gly Ser Val Cys Leu Val Thr Asn Glu Gly Asn 
210 215 220 

gcg cga atg tgt acc acg ctg cct aaa acg cat att gca gtg atg gga 
Ala Arg Met Cys Thr Thr Leu Pro Lys Thr His He Ala Val Met Gly 
225 " 230 235 240 

atg gag cgt att gee ccc acg ttt gec gag gta gat gta ttg ate acc 
Met Glu Arg He Ala Pro Thr Phe Ala Glu Val Asp Val Leu He Thr 
245 250 255 

atg ctg gcg cgc agt gee gtt ggt gca cgt ttg acg gga tac aac acc 
Met Leu Ala Arg Ser Ala Val Gly Ala Arg Leu Thr Gly Tyr Asn Thr 
260 265 270 

tgg ctg aca gga ccg cgc gaa get ggg cac gtt gat ggt cct gaa gag 
Trp Leu Thr Gly Pro Arg Glu Ala Gly His Val Asp Gly Pro Glu Glu 
275 280 285 



gaa ttt egg gat gtg ctg cgc tgt att cgc tgc ggg get tgt atg aat 
Glu Phe Arg Asp Val Leu Arg Cys He Arg Cys Gly Ala Cys Met Asn 
305 310 315 320 

act tgt ccg gca tat cgc cat att ggc ggt cat gga tat ggc tct att 
Thr Cys Pro Ala Tyr Arg His He Gly Gly His Gly Tyr Gly Ser He 
325 330 335 

tat cca ggg cca att ggt gcg gtg att tct ccg eta ctt ggc ggc tat 



432 



480 



528 



ggg ccg gaa acg cct gaa gcg atg acc tta ttc ate egg caa aaa ate 576 
Gly Pro Glu Thr Pro Glu Ala Met Thr Leu Phe He Arg Gin Lys He 
180 185 190 



624 



672 



720 



768 



816 



864 



ttt cat ctg gtt att gtc gat aac ggg cgt tct gag gtg ctg gee tct 912 
Phe His Leu Val He Val Asp Asn Gly Arg Ser Glu Val Leu Ala Ser 
290 295 300 



960 



1008 



1056 
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Tyr Pro Gly Pro He Gly Ala Val He Ser Pro Leu Leu Gly Gly Tyr 

340 345 350 

aaa gat ttt aaa gat tta ccc tac gcc tgc tct tta tgc aca get tgt 1104 

Lys Asp Phe Lys Asp Leu Pro Tyr Ala Cys Ser Leu Cys Thr Ala Cys 

355 ~ 360 365 

gac aac gtg tgt ccg gtg cgt att ccg ctg tea aaa ctg att ttg cgt 1152 

Asp Asn Val Cys Pro Val Arg He Pro Leu Ser Lys Leu He Leu Arg 

370 375 380 

cat cgt egg gtg atg get gaa aaa ggg ate ace gca aaa gca gag caa 1200 

His Arg Arg Val Met Ala Glu Lys Gly He Thr Ala Lys Ala Glu Gin 

385 390 395 400 

egg gcg ata aaa atg ttc get tat gcc aat agt cat cca gga ttg tgg 1248 

Arg Ala He Lys Met Phe Ala Tyr Ala Asn Ser His Pro Gly Leu Trp 

405 410 415 

aaa gtc ggg atg atg gcc ggt get cat gcg gca age tgg ttt ate aat 1296 

Lys Val Gly Met Met Ala Gly Ala His Ala Ala Ser Trp Phe He Asn 

420 425 430 

ggc ggc aaa aca cca etc aaa ttt ggc gcg att age gac tgg atg gaa 134 4 

Gly Gly Lys Thr Pro Leu Lys Phe Gly Ala He Ser Asp Trp Met Glu 

4 35 4 40 44 5 

gca cgc gat ctt cct gaa get gac gga gag agt ttc cgt agt tgg ttt 1392 

Ala Arg Asp Leu Pro Glu Ala Asp Gly Glu Ser Phe Arg Ser Trp Phe 

450 455 460 

aag aaa cat cag gcg cag gag aaa aag aat gga taa 1428 

Lys Lys His Gin Ala Gin Glu Lys Lys Asn Gly * 

465 470 475 



<210> 233 
<211> 849 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . . (849) 

<400> 233 

gtg etc atg egg caa get ggt tta tea atg gcg gca aaa cac cac tea 48 
Met Leu Met Arg Gin Ala Gly Leu Ser Met Ala Ala Lys His His Ser 
15 10 15 

aat ttg gcg cga tta gcg act gga tgg aag cac gcg ate ttc ctg aag 96 
Asn Leu Ala Arg Leu Ala Thr Gly Trp Lys His Ala He Phe Leu Lys 
20 25 30 

ctg acg gag aga gtt tec gta gtt ggt tta aga aac ate agg cgc agg 144 
Leu Thr Glu Arg Val Ser Val Val Gly Leu Arg Asn He Arg Arg Arg 
35 40 45 

aga aaa aga atg gat aat cga ggc gaa ttt ttg aat aac gtt get cag 192 
Arg Lys Arg Met Asp Asn Arg Gly Glu Phe Leu Asn Asn Val Ala Gin 
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50 55 60 

gca ctg ggt cgc ccg ctg cga ctt gaa ccg caa gca gaa gat gcg ccg 240 
Ala Leu Gly Arg Pro Leu Arg Leu Glu Pro Gin Ala Glu Asp Ala Pro 
65 70 75 80 

ctt aac aac tat get aac gag egg ctt acc caa ctt aac caa cag cag 288 
Leu Asn Asn Tyr Ala Asn Glu Arg Leu Thr Gin Leu Asn Gin Gin Gin 
85 90 95 

cgc tgt gac gcg ttt att cag ttt gec age gat gtt atg ttg acg cgc 336 
Arg Cys Asp Ala Phe He Gin Phe Ala Ser Asp Val Met Leu Thr Arg 
100 105 110 



tgt gag ctg acc age gag gcg aag gcg gca gaa get gca ata cgt ctg 
Cys Glu Leu Thr Ser Glu Ala Lys Ala Ala Glu Ala Ala He Arg Leu 
115 120 125 



ggt gaa cga atg cct tec tgc att aac ate att age ggc ccc agt tea 

Gly Glu Arg Met Pro Ser Cys He Asn He He Ser Gly Pro Ser Ser 
245 250 255 

acg gcg gat att gag ctt ate aaa gtc gtc gga gtt cat ggc ccg gtg 

Thr Ala Asp He Glu Leu He Lys Val Val Gly Val His Gly Pro Val 
260 265 270 

aaa gcg gtg tat ctg att att gag gat- tgt tga 

Lys Ala Val Tyr Leu He He Glu Asp Cys * 

275 280 



<210> 234 



384 



576 



tgt aaa gag ctg gga gat cag teg gtc gtg att age ggt gac acg agg 4 32 

Cys Lys Glu Leu Gly Asp Gin Ser Val Val He Ser Gly Asp Thr Arg 
130 135 140 

ctg gag gaa ttg ggg att age gaa cgt ttg cag cag gaa tgc aat gee 4 80 

Leu Glu Glu Leu Gly He Ser Glu Arg Leu Gin Gin Glu Cys Asn Ala 
145 150 155 160 

gtt gtt tgg gat ccg gcg aaa ggt gee gag aat ate teg cag gca gag 528 
Val Val Trp Asp Pro Ala Lys Gly Ala Glu Asn He Ser Gin Ala Glu 
165 170 175 

cag get aaa gtg ggt gtt gtg tat get gaa tat ggt tta acc gaa teg 
Gin Ala Lys Val Gly Val Val Tyr Ala Glu Tyr Gly Leu Thr Glu Ser 
180 ^ 185 190 

gga ggc gtg gtt ctt ttt tec gee gee gag cgc ggg cgt tea ttg age 624 
Gly Gly Val Val Leu Phe Ser Ala Ala Glu Arg Gly Arg Ser Leu Ser 
195 200 205 

ctg etc ccg gaa tat tct ctt ttt ate ctg cgt aaa age act ate ctg 672 
Leu Leu Pro Glu Tyr Ser Leu Phe He Leu Arg Lys Ser Thr He Leu 
210 215 220 

ccg cgt gta gcg caa etc gca gaa aaa ttg cat cag aaa gcg cag gee 720 
Pro Arg Val Ala Gin Leu Ala Glu Lys Leu His Gin Lys Ala Gin Ala 
225 230 235 240 



768 



816 



849 
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<211> 1173 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . . (1173) 

<400> 234 

ttg age gag ata gcg atg cac gtt aca gcc aag ccc tec agt ttt caa 

Met Ser Glu He Ala Met His Val Thr Ala Lys Pro Ser Ser Phe Gin 

15 10 15 



ttg eta gaa acc ggg acg ccg aat att gat ttc agt ggt cat agt gag 
Leu Leu Glu Thr Gly Thr Pro Asn He Asp Phe Ser Gly His Ser Glu 
195 200 205 



48 



tgt aat etc aaa tgt gat tac tgt ttt tac ctt gaa aaa gag teg cag 96 
Cys Asn Leu Lys Cys Asp Tyr Cys Phe Tyr Leu Glu Lys Glu Ser Gin 
20 25 30 

ttt act cat gaa aaa tgg atg gat gac age act ctg aaa gag ttc ate 144 
Phe Thr His Glu Lys Trp Met Asp Asp Ser Thr Leu Lys Glu Phe He 
35 40 45 

aaa caa tat ate gca gcg tct ggc aat cag gtc tat ttt acc tgg caa 192 
Lys Gin Tyr He Ala Ala Ser Gly Asn Gin Val Tyr Phe Thr Trp Gin 
50 55 60 

ggc ggt gaa ccc act ctg get ggc ctg gat ttt ttc cgt aaa gtt att 24 0 

Gly Gly Glu Pro Thr Leu Ala Gly Leu Asp Phe Phe Arg Lys Val He 
65 70 75 80 

cac tat caa caa cgc tat gca ggc caa aaa cgt att ttt aat gca tta 288 
His Tyr Gin Gin Arg Tyr Ala Gly Gin Lys Arg He Phe Asn Ala Leu 
85 90 95 

caa acg aat ggc att tta ttg aat aat gaa tgg tgt gcc ttt etc aaa 336 
Gin Thr Asn Gly He Leu Leu Asn Asn Glu Trp Cys Ala Phe Leu Lys 
100 105 110 

gaa cat gaa ttt ctg gtt ggt ate teg ate gat ggc ccc cag gag tta 384 
Glu His Glu Phe Leu Val Gly He Ser lie Asp Gly Pro Gin Glu Leu 
115 120 125 

cat gac cgt tac aga cgc agt aat tea ggt aac ggt act ttt gca aaa 432 
His Asp Arg Tyr Arg Arg Ser Asn Ser Gly Asn Gly Thr Phe Ala Lys 
130 135 140 

gtg ata gca gcc ate gag cgt ctg aaa tea tat caa gta gag ttt aat 
Val He Ala Ala He Glu Arg Leu Lys Ser Tyr Gin Val Glu Phe Asn 
145 150 155 160 

acg tta acc gtc att aat aac gtt aat gtc cat tac cct ctt gag gtt 528 
Thr Leu Thr Val He Asn Asn Val Asn Val His Tyr Pro Leu Glu Val 
165 170 175 

tat cat ttt tta aaa tct ate ggc agt aaa cat atg caa ttt ate gaa 57 6 

Tyr His Phe Leu Lys Ser He Gly Ser Lys His Met Gin Phe He Glu 
180 ' 185 190 



480 



624 
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aac aca ttc cgt ate att gat ttt tct gtg cct ccc acg get tat ggc 
Asn Thr Phe Arg He He Asp Phe Ser Val Pro Pro Thr Ala Tyr Gly 
210 215 220 

aag ttt atg tea ace att ttt atg caa tgg gtt aaa aac gat gtg ggt 
Lys Phe Met Ser Thr He Phe Met Gin Trp Val Lys Asn Asp Val Gly 
225 230 235 240 



gtt gtt gaa agt aat gga gac att tac gaa tgc gac cat ttt gtc tat 
Val Val Glu Ser Asn Gly Asp He Tyr Glu Cys Asp His Phe Val Tyr 
275 280 285 



aaa caa atg gag aat aat taa 
Lys Gin Met Glu Asn Asn * . 
385 390 



672 



720 



gaa att ttc ate cgt cag ttt gaa age ttt gtc age cgt ttt ttg ggg 768 
Glu He Phe He Arg Gin Phe Glu Ser Phe Val Ser Arg Phe Leu Gly 
245 250 255 

aat ggg cat ace agt tgt att ttc cag gag tec tgc aag gat aat ctg 816 
Asn Gly His Thr Ser Cys He Phe Gin Glu Ser Cys Lys Asp Asn Leu 
260 265 270 



864 



cca cag tac aaa att gga aac att aat aaa tct gaa etc aaa acg atg 912 
Pro Gin Tyr Lys He Gly Asn He Asn Lys Ser Glu Leu Lys Thr Met 
290 295 300 

aac agt gta caa ctg aca gcg caa aaa aaa egg att cca gcg aaa tgt 
Asn Ser Val Gin Leu Thr Ala Gin Lys Lys Arg lie Pro Ala Lys Cys 
305 310 315 320 

cag caa tgt gca tat aaa cct ate tgc aat ggc ggt tgt cct aag cat 
Gin Gin Cys Ala Tyr Lys Pro He Cys Asn Gly Gly Cys Pro Lys His 
325 330 335 

cgt att act aaa gta aac aat gag act gtt tct tat ttt tgc gaa ggt 
Arg He Thr Lys Val Asn Asn Glu Thr Val Ser Tyr Phe Cys Glu Gly 
340 345 350 



960 



1008 



1056 



tat aaa ate ctt ttt tea acc atg gta cct tat atg aac gee atg gta 1104 
Tyr Lys He Leu Phe Ser Thr Met Val Pro Tyr Met Asn Ala Met Val 
355 360 365 

gag tta get aag aac aga gta ccg ctt tac cac att atg gat gtt gca 1152 
Glu Leu Ala Lys Asn Arg Val Pro Leu Tyr His He Met Asp Val Ala 
370 375 380 



1173 



<210> 235 
<211> 1716 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . . (1716) 



<400> 235 
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ttg gaa tta agt aca ata aga aga gga aca ttt atg aag tct gca tta 4 8 

Met Glu Leu Ser Thr He Arg Arg Gly Thr Phe Met Lys Ser Ala Leu 
15 10 15 



aag aaa agt gtc gta agt acc teg ata tct ttg ata ctg gca tct ggt 
Lys Lys Ser Val Val Ser Thr Ser He Ser Leu He Leu Ala Ser Gly 
20 25 30 



acc aaa aca aac gtt get ttc tea gac ttt acg ccg aca gaa tac agt 
Thr Lys Thr Asn Val Ala Phe Ser Asp Phe Thr Pro Thr Glu Tyr Ser 
50 55 60 



gaa ggc gta cgt ttt act aac ggc tat gtg gca cac ggt gtt tec ggc 

Glu Gly Val Arg Phe Thr Asn Gly Tyr Val Ala His Gly Val Ser Gly 

130 135 140 

ccc tec cgc gec gca ata atg acc ggt cga get ccc gec cgc ttt ggt 

Pro Ser Arg Ala Ala He Met Thr Gly Arg Ala Pro Ala Arg Phe Gly 

145 ~ 150 155 160 

gtc tat tec aat acc gat get cag gat ggt att ccg eta aca gaa act 

Val Tyr Ser Asn Thr Asp Ala Gin Asp Gly He Pro Leu Thr Glu Thr 

165 170 175 

ttc ttg cct gaa tta ttc cag aat cat ggt tat tac act gca gca gta 

Phe Leu Pro Glu Leu Phe Gin Asn His Gly Tyr Tyr Thr Ala Ala Val 

180 185 190 

ggt aaa tgg cac ttg tea aaa ate agt aat gtg ccg gta ccg gaa gat 

Gly Lys Trp His Leu Ser Lys He Ser Asn Val Pro Val Pro Glu Asp 

195 200 205 

aaa caa acg cgt gac tat cat gac aac ttc acc aca ttt tct gcg gaa 

Lys Gin Thr Arg Asp Tyr His Asp Asn Phe Thr Thr Phe Ser Ala Glu 

210 215 220 

gaa tgg caa cct caa aac cgt ggc ttt gat tac ttt atg gga ttc cac 

Glu Trp Gin Pro Gin Asn Arg Gly Phe Asp Tyr Phe Met Gly Phe His 

225 230 235 240 



96 



atg get gca ttt get get cat gcg gca gat gat gta aag ctg aaa gca 14 4 

Met Ala Ala Phe Ala Ala His Ala Ala Asp Asp Val Lys Leu Lys Ala 
35 40 45 



192 



acc aaa gga aag cca aat att ate gta ctg acc atg gat gat ctt ggt 24 0 

Thr Lys Gly Lys Pro Asn He He Val Leu Thr Met Asp Asp Leu Gly 
65 70 75 80 

tat gga caa ctt cct ttt gat aag gga tct ttt gac cca aaa aca atg 
Tyr Gly Gin Leu Pro Phe Asp Lys Gly Ser Phe Asp Pro Lys Thr Met 
85 90 95 



288 



gaa aat cgt gaa gtt gtc gat acc tac aaa ata ggg ata gat aaa gec 336 
Glu Asn Arg Glu Val Val Asp Thr Tyr Lys He Gly He Asp Lys Ala 
100 105 110 

att gaa get gca caa aaa tea acg ccg acg etc ctt tea tta atg gat 38 4 

He Glu Ala Ala Gin Lys Ser Thr Pro Thr Leu Leu Ser Leu Met Asp 
115 120 125 



432 



480 



528 



576 



624 



672 



720 
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get gca gga acg gca tat tac aac tec cct tea ctg ttc aaa aat cgt 768 
Ala Ala Gly Thr Ala Tyr Tyr Asn Ser Pro Ser Leu Phe Lys Asn Arg 
245 * 250 255 

gaa cgt gtc ccc gca aaa ggt tat ate age gat cag tta ace gat gag 816 
Glu Arg Val Pro Ala Lys Gly Tyr lie Ser Asp Gin Leu Thr Asp Glu 
260 265 270 

gca att ggc gtt gtt gat cgt gec aaa aca ctt gac cag cct ttt atg 864 
Ala He Gly Val Val Asp Arg Ala Lys Thr Leu Asp Gin Pro Phe Met 
275 280 285 

ctt tac ctg get tat aat get ccg cac ctg cca aat gat aat cct gca 912 
Leu Tyr Leu Ala Tyr Asn Ala Pro His Leu Pro Asn Asp Asn Pro Ala 
290 295 300 

ccg gat caa tat cag aag caa ttt aat acc ggt agt caa aca gca gat 960 
Pro Asp Gin Tyr Gin Lys Gin Phe Asn Thr Gly Ser Gin Thr Ala Asp 
305 * 310 315 320 

aac tac tac get tec gtt tat tct gtt gat cag ggt gta aaa cgc att 1008 
Asn Tyr Tyr Ala Ser Val Tyr Ser Val Asp Gin Gly Val Lys Arg lie 
325 330 335 

etc gaa caa ctg aag aaa aac gga cag tat gac aat aca att att etc 1056 
Leu Glu Gin Leu Lys Lys Asn Gly Gin Tyr Asp Asn Thr He He Leu 
340 345 350 

ttt acc tec gat aat ggt gcg gtt ate gat ggt cct ctg ccg ctg aac 1104 
Phe Thr Ser Asp Asn Gly Ala Val He Asp Gly Pro Leu Pro Leu Asn 
355 360 365 

ggg gcg caa aaa ggc tat aag agt cag acc tat cct ggc ggt act cac 1152 
Gly Ala Gin Lys Gly Tyr Lys Ser Gin Thr Tyr Pro Gly Gly Thr His 
370 375 380 

acc cca atg ttt atg tgg tgg aaa gga aaa ctt caa ccc ggt aat tat 1200 
Thr Pro Met Phe Met Trp Trp Lys Gly Lys Leu Gin Pro Gly Asn Tyr 
385 390 395 400 

gac aag ctg att tec gca atg gat ttc tac ccg aca get ctt gat gca 1248 
Asp Lys Leu He Ser Ala Met Asp Phe Tyr Pro Thr Ala Leu Asp Ala 
405 410 415 

gee gat ate age att cca aaa gac ctt aag ctg gat ggc gtt tec ttg 1296 
Ala Asp He Ser lie Pro Lys Asp Leu Lys Leu Asp Gly Val Ser Leu 
420 425 430 

ctg ccc tgg ttg caa gat aag aaa caa ggc gag cca cat aaa aat ctg 1344 
Leu Pro Trp Leu Gin Asp Lys Lys Gin Gly Glu Pro His Lys Asn Leu 
435 440 445 

acc tgg ata acc tct tat tct cac tgg ttt gac gag gaa aat att cca • 1392 
Thr Trp He Thr Ser Tyr Ser His Trp Phe Asp Glu Glu Asn lie Pro 
450 455 460 

ttc tgg gat aat tac cac aaa ttt gtt cgc cat cag tea gac gat tac 14 40 

Phe Trp Asp Asn Tyr His Lys Phe Val Arg His Gin Ser Asp Asp Tyr 
465 470 475 480 
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ccg cat aac ccc aac act gag gac tta age caa ttc tct tat acg gtg 1488 
Pro His Asn Pro Asn Thr Glu Asp Leu Ser Gin Phe Ser Tyr Thr Val 
485 490 495 

aga aat aac gat tat teg ctt gtc tat aca gta gaa aac aat cag tta 1536 
Arg Asn Asn Asp Tyr Ser Leu Val Tyr Thr Val Glu Asn Asn Gin Leu 
500 505 510 

ggt etc tac aaa ctg acg gat eta cag caa aaa gat aac ctt gcc gcc 1584 
Gly Leu Tyr Lys Leu Thr Asp Leu Gin Gin Lys Asp Asn Leu Ala Ala 
515 520 525 

gcc aat ccg cag gtc gtt aaa gag atg caa ggc gtg gta aga gag ttt 1632 
Ala Asn Pro Gin Val Val Lys Glu Met Gin Gly Val Val Arg Glu Phe 
530 535 540 

ate gac age age cag cca ccg ctt age gag gta aat cag gag aag ttt 1680 
lie Asp Ser Ser Gin Pro Pro Leu Ser Glu Val Asn Gin Glu Lys Phe 
545 550 555 560 

aac aat ate aag aaa gca eta age gaa gcg aaa taa 1716 
Asn Asn lie Lys Lys Ala Leu Ser Glu Ala Lys * 
565 570 



<210> 236 
<211> 825 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . . (825) 

<400> 236 

atg acc gaa ttt aca act ctt ctt cag caa gga aac gcc tgg ttc ttc 48 

Met Thr Glu Phe Thr Thr Leu Leu Gin Gin Gly Asn Ala Trp Phe Phe 

15 10 15 

ate ccc age gcc ate tta ctt ggt gcg ctt cat ggc ctg gaa cca ggg 96 
lie Pro Ser Ala lie Leu Leu Gly Ala Leu His Gly Leu Glu Pro Gly 
20 25 30 

cac tea aaa acg atg atg gcg gcg ttt ate ate gcc ate aaa ggc acc 144 
His Ser Lys Thr Met Met Ala Ala Phe He He Ala He Lys Gly Thr 
35 40 45 

att aaa caa gcg gtg atg etc gga ctg gca gca act att teg cat acc 192 
He Lys Gin Ala Val Met Leu Gly Leu Ala Ala Thr He Ser His Thr 
50 55 60 

gca gtg gtc tgg tta att gcc ttt ggc ggg atg gtg ate age aag cgc 240 
Ala Val Val Trp Leu lie Ala Phe Gly Gly Met Val He Ser Lys Arg 
65 70 75 80 

ttt act get caa tea gca gaa ccg tgg etc cag ctg att tec gca gtg 288 
Phe Thr Ala Gin Ser Ala Glu Pro Trp Leu Gin Leu He Ser Ala Val 
85 90 95 

ate att att age acc gcg ttc tgg atg ttc tgg cgt acc tgg cgc ggc 336 
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lie He lie Ser Thr Ala Phe Trp Met Phe Trp Arg Thr Trp Arg Gly 
100 105 HO 

gaa cgc aac tgg ctg gag aat atg cac ggg cat gat tat gag cat cat 
Glu Arg Asn Trp Leu Glu Asn Met His Gly His Asp Tyr Glu His His 
115 " 120 125 



gaa cat ggc gag tat cag gat gcc cat gca cga gcc cat gcc aat gac 
Glu His Gly Glu Tyr Gin Asp Ala His Ala Arg Ala His Ala Asn Asp 
145 150 155 160 



384 



cat cac gat cac gaa cat cac cac gac cat gga cat cat cac cat cac 4 32 

His His Asp His Glu His His His Asp His Gly His His His His His 
130 135 140 



480 



att aaa cga cgc ttt gat ggt aga gag gtc acc aac tgg caa att ttg 528 
He Lys Arg Arg Phe Asp Gly Arg Glu Val Thr Asn Trp Gin He Leu 
165 1"70 175 

tta ttt ggc tta acc ggt ggc ctt ate ccc tgc ccg gca gca att acc 576 
Leu Phe Gly Leu Thr Gly Gly Leu He Pro Cys Pro Ala Ala He Thr 
180 185 190 

gtg ctg ttg ata tgc att cag ttg aaa gcc ctg aca ctg ggc gca aca 624 
Val Leu Leu He Cys He Gin Leu Lys Ala Leu Thr Leu Gly Ala Thr 
195 200 205 

ctg gtc gtc agt ttc age att ggc ctg gcg tta acg ctt gtc acc gta 672 
Leu Val Val Ser Phe Ser He Gly Leu Ala Leu Thr Leu Val Thr Val 
210 215 220 

ggc gtt ggc gca gca ate age gtt cag cag gtc gca aaa cgc tgg age 720 
Gly Val Gly Ala Ala He Ser Val Gin Gin Val Ala Lys Arg Trp Ser 
225 230 235 240 

gga ttt aac act etc get aaa cgc gcc ccc tat ttt tec agt ctg ttg 768 
Gly Phe Asn Thr Leu Ala Lys Arg Ala Pro Tyr Phe Ser Ser Leu Leu 
245 250 255 

att ggc tta gtc ggt gtg tat atg ggc gta cat ggc ttc atg ggc ata 816 
He Gly Leu Val Gly Val Tyr Met Gly Val His Gly Phe Met Gly He 
260 265 270 

atg cga taa 825 
Met Arg * 



<210> 237 
<211> 519 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . . (519) 

<400> 237 

atg att ctt aaa tea get att tec get gac tct ctg etc get aag gat 48 
Met He Leu Lys Ser Ala He Ser Ala Asp Ser Leu Leu Ala Lys Asp 
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15 10 15 

get ttt agg gca tec ttt cat tta cac ttt tta cga aat cat ggg ate 96 

Ala Phe Arg Ala Ser Phe His Leu His Phe Leu Arg Asn His Gly lie 
20 25 30 

act aac aaa ata teg ctt gtc agt tat att gta tgg cag gaa aga tat 144 

Thr Asn Lys He Ser Leu Val Ser Tyr He Val Trp Gin Glu Arg Tyr 
35 40 45 

gcg act gat att aca gat ccc caa agt gga gag ttt atg ace att aaa 192 

Ala Thr Asp He Thr Asp Pro Gin Ser Gly Glu Phe Met Thr He Lys 
50 55 60 

aat aag atg ttg ctg ggt gcg ctt ttg ctg gtt acc agt gee gec tgg 240 

Asn Lys Met Leu Leu Gly Ala Leu Leu Leu Val Thr Ser Ala Ala Trp 

65 70 75 80 

gee gca cca gec acc gcg ggt teg acc aat acc teg gga att tct aag 288 

Ala Ala Pro Ala Thr Ala Gly Ser Thr Asn Thr Ser Gly He Ser Lys 
85 90 95 

tat gag tta agt agt ttc att get gac ttt aag cat ttc aaa cca ggg 336 

Tyr Glu Leu Ser Ser Phe He Ala Asp Phe Lys His Phe Lys Pro Gly 
100 105 110 

gac acc gta cca gaa atg tac cgt acc gat gag tac aac att aag cag 384 

Asp Thr Val Pro Glu Met Tyr Arg Thr Asp Glu Tyr Asn He Lys Gin 
115 120 125 

tgg cag ttg cgt aac ctg ccc gcg cct gat gec ggg acg cac tgg acc 432 

Trp Gin Leu Arg Asn Leu Pro Ala Pro Asp Ala Gly Thr His Trp Thr 

130 135 140 

tat atg ggt ggc gcg tac gtg ttg ate age gac acc gac ggt aaa ate 4 80 

Tyr Met Gly Gly Ala Tyr Val Leu He Ser Asp Thr Asp Gly Lys He 

145 150 155 160 

att aaa gee tac gac ggt gag att ttt tat cat cgc taa 519 

He Lys Ala Tyr Asp Gly Glu He Phe Tyr His Arg * 
165 170 



<210> 238 
<211> 1704 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . . (1704) 

<400> 238 

atg get ttt gta tea gaa aga gaa att gta aga aag ata ttt age aaa 48 

Met Ala Phe Val Ser Glu Arg Glu He Val Arg Lys He Phe Ser Lys 
15 10 15 

aaa att gac ttt acg ate ttg gca ttt ttc tat att tec tct att ttt 96 
Lys He Asp Phe Thr He Leu Ala Phe Phe Tyr He Ser Ser He Phe 
20 25 30 
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ttt ctt ctt tgt agt ggc gtt etc ttt caa tat ttt acc get gec ttc 144 
Phe Leu Leu Cys Ser Gly Val Leu Phe Gin Tyr Phe Thr Ala Ala Phe 
35 ~ ~ 40 45 

aca aaa ggt aat tgt tat gaa tgc tea atg aaa ctt gat tac ata aaa 192 
Thr Lys Gly Asn Cys Tyr Glu Cys Ser Met Lys Leu Asp Tyr lie Lys 
50 55 60 

caa ttt tat ttt tea ctg gaa acc gca tgg tac etc ata tct get gtt 240 
Gin Phe Tyr Phe Ser Leu Glu Thr Ala Trp Tyr Leu He Ser Ala Val 
65 70 75 80 

gca gta ttt att gca tct gtt ttt ata cag cat aga att aaa get tat 288 
Ala Val Phe He Ala Ser Val Phe lie Gin His Arg He Lys Ala Tyr 
85 * 90 95 

tta aca tta tta get att aca tgg att gta eta aca ata aca gat gtg 336 
Leu Thr Leu Leu Ala He Thr Trp He Val Leu Thr He Thr Asp Val 
100 105 110 

gcg tta ata cac gee tta gac aat ata gee atg aat aat att ttg tta 384 
Ala Leu lie His Ala Leu Asp Asn He Ala Met Asn Asn He Leu Leu 
115 120 125 

aat ata eta tat aat ctt ttt ggg gcg att tta ttg tea ctg ttt atg 432 
Asn lie Leu Tyr Asn Leu Phe Gly Ala He Leu Leu Ser Leu Phe Met 
130 135 140 

tgt tta tea aat tec etc ctt ttc cac ctt aat aag ata aaa cat att 480 
Cys Leu Ser Asn Ser Leu Leu Phe His Leu Asn Lys He Lys His He 
145 150 155 160 

cca atg att tta teg gca atg att cca tta gtt tec gca ata ata ata 528 
Pro Met He Leu Ser Ala Met He Pro Leu Val Ser Ala He He He 
165 170 175 

gee ata etc ata acc get gtt ata tat tta ttg ttt gee egg cag gcg 576 
Ala He Leu He Thr Ala Val He Tyr Leu Leu Phe Ala Arg Gin Ala 
180 185 190 

gta gag att gag atg gat ata tct gaa ggt tea gac ate gca tat gta 624 
Val Glu He Glu Met A3p He Ser Glu Gly Ser Asp He Ala Tyr Val 
195 200 205 

ggt gta aaa gat aat gag gaa tea ttt ggg ttt tta aat gat aaa aaa 672 
Gly Val Lys Asp Asn Glu Glu Ser Phe Gly Phe Leu Asn Asp Lys Lys 
210 " 215 220 

aca gac aca cca acg tat ctg gat gta ate aaa aat ggt agt ttg ata 720 
Thr Asp Thr Pro Thr Tyr Leu Asp Val He Lys Asn Gly Ser Leu He 
225 230 235 240 

tat aat gat aca cag ggt tta agt ggt get gat att tat att gtc tec 7 68 

Tyr Asn Asp Thr Gin Gly Leu Ser Gly Ala Asp He Tyr He Val Ser 
245 250 255 

ggt tgt tat gec ctt cca gat tta ctt cgc aat gta ccg ctt gat get 816 
Gly Cys Tyr Ala Leu Pro Asp Leu Leu Arg Asn Val Pro Leu Asp Ala 
260 265 270 
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aaa aaa tct ttt ctg aat gta aaa aag eta gaa ate aca cag aaa ctt 864 
Lys Lys Ser Phe Leu Asn Val Lys Lys Leu Glu lie Thr Gin Lys Leu 
275 280 285 

cca atg atg ggg ttc ata caa ggt gaa tec get gat gtg atg cct aaa 912 
Pro Met Met Gly Phe lie Gin Gly Glu Ser Ala Asp Val Met Pro Lys 
290 295 300 

gca gca tec agg tta agt ttg age aag caa gat gat aaa ttt atg eta 960 
Ala Ala Ser Arg Leu Ser Leu Ser Lys Gin Asp Asp Lys Phe Met Leu 
305 310 315 320 

gee tea agt gtt act gac tct caa ata aaa ttt aaa tea aac aat gca 1008 
Ala Ser Ser Val Thr Asp Ser Gin He Lys Phe Lys Ser Asn Asn Ala 
325 330 335 

caa ttg atg gtt get ttc gca ttt atg cca ata aca acg aat ggt att 1056 
Gin Leu Met Val Ala Phe Ala Phe Met Pro He Thr Thr Asn Gly He 
340 345 350 

tta cat gat tat aca tac gat ata ata ata aat gat aaa aaa tat aaa 1104 
Leu His Asp Tyr Thr Tyr Asp He He He Asn Asp Lys Lys Tyr Lys 
355 360 365 

att gaa aat cat gtt gca cct eta tct agg ctt gat aaa aat aag aag 1152 
He Glu Asn His Val Ala Pro Leu Ser Arg Leu Asp Lys Asn Lys Lys 
370 375 380 

atg aag tgt gaa tac cag caa ata teg gat tta aca aat acg tat aac 1200 
Met Lys Cys Glu Tyr Gin Gin lie Ser Asp Leu Thr Asn Thr Tyr Asn 
385 390 395 400 

att aat gca aat tac tta acg ggt ttt tta ctt gtc eta aaa cca gat 1248 
lie Asn Ala Asn Tyr Leu Thr Gly Phe Leu Leu Val Leu Lys Pro Asp 
405 410 415 

gat att att aat tac aac aat age cct tec gta eta etc aag act gat 1296 
Asp lie lie Asn Tyr Asn Asn Ser Pro Ser Val Leu Leu Lys Thr Asp 
420 425 430 

ttt gcg ttc tat aaa aaa aca tac cag aag ctt gat aaa ata tat gat 1344 
Phe Ala Phe Tyr Lys Lys Thr Tyr Gin Lys Leu Asp Lys lie Tyr Asp 
435 " 440 445 

gac ata age aat gga aaa tta tct tea ctg aga gca aca ggt ate tec 1392 
Asp He Ser Asn Gly Lys Leu Ser Ser Leu Arg Ala Thr Gly He Ser 
450 J 455 460 

cag ttt agt ate aac ggg aaa cat ttg teg tta agg cca gaa teg gag 1440 
Gin Phe Ser lie Asn Gly Lys His Leu Ser Leu Arg Pro Glu Ser Glu 
465 470 475 480 

att att att tct gaa ggg agt ttg tat gga eta gtt aat aaa age aaa 1488 
lie lie lie Ser Glu Gly Ser Leu Tyr Gly Leu Val Asn Lys Ser Lys 
485 490 495 

aaa ate aaa ata tat ggc aca gca gat ctt gtt ttt gtt gat aac aag 1536 
Lys lie Lys lie Tyr Gly Thr Ala Asp Leu Val Phe Val Asp Asn Lys 
500 505 510 



-192- 



WO 01/34810 



PCT/US00/30950 



ata atg aac ctt cgt aaa ata act tat ttg caa tct aag eta gaa att 1584 
lie Met Asn Leu Arg Lys lie Thr Tyr Leu Gin Ser Lys Leu Glu lie 
515 " 520 525 

ttt ggt tct tct att atg gat ata tta aag tat ata ttt ggt tta ggt 1632 
Phe Gly Ser Ser lie Met Asp lie Leu Lys Tyr lie Phe Gly Leu Gly 
530 535 540 

ctg eta gca att tct ata aaa ttc att cat tct tac ttt aag aat gat 1680 
Leu Leu Ala lie Ser lie Lys Phe He His Ser Tyr Phe Lys Asn Asp 
545 550 555 560 

gtg aat gaa aat tta ttc ttg taa 1704 
Val Asn Glu Asn Leu Phe Leu * 
565 



<210> 239 
<211> 1092 
<212> DNA 

<213> Escherichia coli 



<220> 
<221> CDS 
<222> (1). 



(1092) 



<400> 239 

atg tct aac ttt ate aat att cat gtt ctg ate tct cac age cct tea 4 8 

Met Ser Asn Phe lie Asn He His Val Leu He Ser His Ser Pro Ser 
1 5 10 15 

tgt ctg aac cgc gac gat atg aac atg cag aaa gac get att ttc ggc 96 
Cys Leu Asn Arg Asp Asp Met Asn Met Gin Lys Asp Ala He Phe Gly 
20 25 30 



ggc aaa aga cga gta aga att tea agt caa age ctt aaa cgt gcg atg 
Gly Lys Arg Arg Val Arg He Ser Ser Gin Ser Leu Lys Arg Ala Met 
35 40 45 



144 



cgt aaa agt ggt tat tac gca caa aat att ggt gaa tec agt etc aga 192 

Arg Lys Ser Gly Tyr Tyr Ala Gin Asn He Gly Glu Ser Ser Leu Arg 

50 " ' 55 60 

acc att cat ctt gca caa tta cgt gat gtt ctt egg caa aaa ctt ggt 240 

Thr He His Leu Ala Gin Leu Arg Asp Val Leu Arg Gin Lys Leu Gly 

65 70 75 80 

gaa cgt ttt gac caa aaa ate ate gat aag aca tta gcg ctg etc tec 288 

Glu Arg Phe Asp Gin Lys He He Asp Lys Thr Leu Ala Leu Leu Ser 

85 90 95 



ggt aaa tea gtt gat gaa gee gaa aag att tct gee gat gcg gtt act 
Gly Lys Ser Val Asp Glu Ala Glu Lys lie Ser Ala Asp Ala Val Thr 
100 105 110 

ccc tgg gtt gtg gga gaa ata gee tgg ttc tgt gag cag gtt gca aaa 
Pro Trp Val Val Gly Glu He Ala Trp Phe Cys Glu Gin Val Ala Lys 
115 120 125 



336 



384 
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gca gag get gat aat ctg gat gat aaa aag ctg etc aaa gtt ctt aag 432 
Ala Glu Ala Asp Asn Leu Asp Asp Lys Lys Leu Leu Lys Val Leu Lys 
130 135 140 



gaa gat att gee gec ata cgt gtg aat tta cag cag ggt gtt gat att 

Glu Asp He Ala Ala He Arg Val Asn Leu Gin Gin Gly Val Asp He 

145 150 155 160 

gcg ctt agt gga aga atg gca acc age ggc atg atg act gag ttg gga 

Ala Leu Ser Gly Arg Met Ala Thr Ser Gly Met Met Thr Glu Leu Gly 

165 170 175 



ggt gee tec agg gag cag get ctg gaa att gca acc cat gtt gtt cat 
Gly Ala Ser Arg Glu Gin Ala Leu Glu He Ala Thr His Val Val His 
245 250 255 



gta gac cca att act get caa gtt aaa caa atg cct act tta gaa cag 
Val Asp Pro He Thr Ala Gin Val Lys Gin Met Pro Thr Leu Glu Gin 
340 345 350 



480 



528 



aaa gtt gat ggt gca atg tec att gcg cat gcg ate act act cat cag 57 6 

Lys Val Asp Gly Ala Met Ser He Ala His Ala He Thr Thr His Gin 

180 185 190 

gtt gat tct gat att gac tgg ttc acc get gta gat gat tta cag gaa 624 

Val Asp Ser Asp He Asp Trp Phe Thr Ala Val Asp Asp Leu Gin Glu 

195 200 205 

caa ggt tct gca cat ctg gga act cag gaa ttt tea teg ggt gtt ttt 672 

Gin Gly Ser Ala His Leu Gly Thr Gin Glu Phe Ser Ser Gly Val Phe 

210 215 220 

tat cgt tat gee aac att aac etc get caa ctt cag gaa aat tta ggt 720 

Tyr Arg Tyr Ala Asn He Asn Leu Ala Gin Leu Gin Glu Asn Leu Gly 

225 230 235 240 



768 



atg ctg gca aca gag gtc cct gga gca aaa cag cgt act tat gee get 816 

Met Leu Ala Thr Glu Val Pro Gly Ala Lys Gin Arg Thr Tyr Ala Ala 

260 265 270 

ttt aac cct gcg gat atg gta atg gtt aat ttc tec gat atg cca ctt 

Phe Asn Pro Ala Asp Met Val Met Val Asn Phe Ser Asp Met Pro Leu 

275 280 285 

tct atg gca aat get ttt gaa aaa gcg gtt aaa gcg aaa gat ggc ttt 

Ser Met Ala Asn Ala Phe Glu Lys Ala Val Lys Ala Lys Asp Gly Phe 
290 295 300 



864 



912 



ttg caa ccg tct ata cag gcg ttt aat caa tat tgg gat cgc gtt gee 960 
Leu Gin Pro Ser He Gin Ala Phe Asn Gin Tyr Trp Asp Arg Val Ala 
305 310 315 320 

aat gga tat ggt ctg aac gga get get gcg caa ttc age tta tct gat 1008 
Asn Gly Tyr Gly Leu Asn Gly Ala Ala Ala Gin Phe Ser Leu Ser Asp 
325 330 335 



1056 



tta aaa tec tgg gtt cgt aat aat ggc gag gcg tga 1092 
Leu Lys Ser Trp Val Arg Asn Asn Gly Glu Ala * 
355 360 
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<210> 240 
<211> 366 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . . (366) 



<400> 240 

atg atg tec acc acg tta ttt aaa gat ttc acc ttc gaa gec get cac 
Met Met Ser Thr Thr Leu Phe Lys Asp Phe Thr Phe Glu Ala Ala His 
15 10 15 



48 



cgc tta cca cac gtc ccg gaa ggg cat aaa tgt ggt cgc ctg cac ggg 
Arg Leu Pro His Val Pro Glu Gly His Lys Cys Gly Arg Leu His Gly 
20 25 30 



96 



cat tec ttt atg gtg cga ctg gaa att acc ggg gaa gtc gat ccg cat 
His Ser Phe Met Val Arg Leu Glu lie Thr Gly Glu Val Asp Pro His 
35 40 45 



144 



acg ggc tgg att ate gat ttc get gaa eta aaa gcg gcg ttt aaa cca 
Thr Gly Trp lie lie Asp Phe Ala Glu Leu Lys Ala Ala Phe Lys Pro 
50 55 60 



192 



acc tac gag cgc etc gat cac cat tat etc aat gat att cca ggt ctg 
Thr Tyr Glu Arg Leu Asp His His Tyr Leu Asn Asp lie Pro Gly Leu 
65 70 75 80 



240 



gaa aac cca acc age gag gtt tta gca aaa tgg att tgg gat cag gtt 
Glu Asn Pro Thr Ser Glu Val Leu Ala Lys Trp lie Trp Asp Gin Val 
85 90 95 



288 



aaa ccc gtt gtg ccg ctg tta agt gcg gtg atg gta aaa gaa acc tgc 
Lys Pro Val Val Pro Leu Leu Ser Ala Val Met Val Lys Glu Thr Cys 
100 105 110 



336 



acc gca ggt tgt ate tat cgc ggc gaa tga 
Thr Ala Gly Cys He Tyr Arg Gly Glu * 

120 



366 





115 




<210> 


241 




<211> 


1302 




<212> 


DNA 




<213> 


Escherichia coli 


<220> 






<221> 


CDS 




<222> 


(1) ... 


(1302) 


<400> 


241 




gtg ttg tgt 


tct gga tat 


Met Leu Cys 


Ser Gly Tyr 


1 




5 



48 



10 



15 



att att att att ggt gec ggt att gca ggc acc get tgc gcg tta cgc 
He He He He Gly Ala Gly He Ala Gly Thr Ala Cys Ala Leu Arg 



96 
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20 25 30 

tgc gcg cga gcg ggt tta tec gtt ttg tta ctg gaa cgc get gaa ate 144 

Cys Ala Arg Ala Gly Leu Ser Val Leu Leu Leu Glu Arg Ala Glu lie 

35 40 4 5 

ccc ggc age aaa aat ctt tec ggc ggg egg tta tat acc cat gca etc 192 

Pro Gly Ser Lys Asn Leu Ser Gly Gly Arg Leu Tyr Thr His Ala Leu 

50 55 60 

gcg gaa etc etc ccg caa ttt cat ctg acc gcg cct ctt gaa cga cgc 240 

Ala Glu Leu Leu Pro Gin Phe His Leu Thr Ala Pro Leu Glu Arg Arg 

65 70 75 80 

ate act cac gaa age ctt tec ctg tta acg ccc gat ggc gta acg acg 288 

He Thr His Glu Ser Leu Ser Leu Leu Thr Pro Asp Gly Val Thr Thr 

85 90 95 

ttt tec age tta cag ccc ggc ggt gaa tec tgg agt gta tta cgt gca 336 

Phe Ser Ser Leu Gin Pro Gly Gly Glu Ser Trp Ser Val Leu Arg Ala 

100 105 110 

cga ttc gat ccg tgg ctg gtt gec gaa gec gaa aaa gaa ggt gtc gaa 384 

Arg Phe Asp Pro Trp Leu Val Ala Glu Ala Glu Lys Glu Gly Val Glu 

115 120 125 

tgc ate ccc gga gcg acg gtg gat gca ctg tat gaa gaa aac ggc aga 432 

Cys He Pro Gly Ala Thr Val Asp Ala Leu Tyr Glu Glu Asn Gly Arg 

130 ^ 135 140 

gtc tgt ggc gtt att tgt ggt gac gat att etc cgc gee cgt tat gtg 480 

Val Cys Gly Val He Cys Gly Asp Asp He Leu Arg Ala Arg Tyr Val 

145 150 * 155 160 

gtg ctg gca gaa ggt gec aac age gtc ctg get gaa cgt cac ggg tta 528 

Val Leu Ala Glu Gly Ala Asn Ser Val Leu Ala Glu Arg His Gly Leu 

165 170 175 

gtg act cgt cct get ggc gaa gcg atg gcg ttg ggg ate aaa gaa gtg 576 

Val Thr Arg Pro Ala Gly Glu Ala Met Ala Leu Gly He Lys Glu Val 

180 185 190 

ctg teg ctg gaa aca tec get att gaa gaa cgt ttt cat ctg gag aat 624 

Leu Ser Leu Glu Thr Ser Ala He Glu Glu Arg Phe His Leu Glu Asn 

195 200 205 

aac gaa ggc gca gcg ttg ctg ttc age ggc agg ate tgt gat gac tta 672 

Asn Glu Gly Ala Ala Leu Leu Phe Ser Gly Arg He Cys Asp Asp Leu 

210 215 220 

ccc ggc ggc gca ttt ctt tat act aat caa caa acg etc teg tta ggg 720 

Pro Gly Gly Ala Phe Leu Tyr Thr Asn Gin Gin Thr Leu Ser Leu Gly 

225 230 235 240 

att gtt tgc ccg etc tct tec ctt acg caa agt cgt gtt ccg gca age 768 
He Val Cys Pro Leu Ser Ser Leu Thr Gin Ser Arg Val Pro Ala Ser 

245 250 255 

gag ctg ctg act cgc ttt aaa gcg cat ccg gca gtg cgc ccg ctt ate 816 

Glu Leu Leu Thr Arg Phe Lys Ala His Pro Ala Val Arg Pro Leu He 
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260 265 270 

aaa aac acg gaa tea ctg gag tat ggt gcg cat ctg gtg cca gaa ggt 864 

Lys Asn Thr Glu Ser Leu Glu Tyr Gly Ala His Leu Val Pro Glu Gly 
275 280 285 

ggc ttg cac agt atg ccg gtg caa tac gec ggt aac ggc tgg ctg ctg 912 

Gly Leu His Ser Met Pro Val Gin Tyr Ala Gly Asn Gly Trp Leu Leu 

290 295 300 



gtg ggc gat gcg ttg cgc agt tgc gtc aat acc gga att tec gtg cgc 
Val Gly Asp Ala Leu Arg Ser Cys Val Asn Thr Gly He Ser Val Arg 
305 310 315 320 

ggc atg gat atg gcg ctg act ggc gcg cag gcg gcg gca caa acg ctg 
Gly Met Asp Met Ala Leu Thr Gly Ala Gin Ala Ala Ala Gin Thr Leu 
325 330 335 

ata age gec tgc cag cac cgc gag ccg caa aat ctg ttt ccg ctt tat 
He Ser Ala Cys Gin His Arg Glu Pro Gin Asn Leu Phe Pro Leu Tyr 
340 345 350 

cat cac aac gta gag cgc age ctg ctg tgg gat gtt eta cag cgt tat 
His His Asn Val Glu Arg Ser Leu Leu Trp Asp Val Leu Gin Arg Tyr 
355 360 365 



cct gcg tta atg cag gat att tec cgc gat tta tgg gat cag ggt gat 
Pro Ala Leu Met Gin Asp He Ser Arg Asp Leu Trp Asp Gin Gly Asp 
385 390 395 400 



960 



1008 



1056 



1104 



cag cat gtt ccg gcg ctt ttg caa cgc ccg gga tgg tac cgt acg tgg 1152 
Gin His Val Pro Ala Leu Leu Gin Arg Pro Gly Trp Tyr Arg Thr Trp 
370 375 380 



1200 



aaa cct gtt cca ccg ctg cgc cag tta ttc tgg cat cat tta cgt cgt 1248 
Lys Pro Val Pro Pro Leu Arg Gin Leu Phe Trp His His Leu Arg Arg 
405 410 415 

cac ggc ctg tgg cat ctg gcg ggc gat gtt ate agg agt ctg cga tgt 1296 
His Gly Leu Trp His Leu Ala Gly Asp Val He Arg Ser Leu Arg Cys 
420 425 430 

ctg tag 1302 
Leu * 



<210> 242 
<211> 297 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) ... (297) 

<400> 242 

gtg gca tct ggc ggg cga tgt tat cag gag tct gcg atg tct gta gec 

Met Ala Ser Gly Gly Arg Cys Tyr Gin Glu Ser Ala Met Ser Val Ala 
15 10 15 
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cgt aat etc tgg cgc gtt get gat gcg ccg cac att gtt ccg get gac 96 
Arg Asn Leu Trp Arg Val Ala Asp Ala Pro His lie Val Pro Ala Asp 
20 25 30 

tec gtt gag cgc cag acg gca gaa egg ttg att aac gec tgt ccg gca 144 
Ser Val Glu Arg Gin Thr Ala Glu Arg Leu lie Asn Ala Cys Pro Ala 
35 40 45 

ggt ctt ttt teg etc aca ccg gaa ggt aac tta cgt att gac tat cgc 192 
Gly Leu Phe Ser Leu Thr Pro Glu Gly Asn Leu Arg lie Asp Tyr Arg 
50 55 60 

agt tgc ctg gag tgt ggc acc tgc cgt ttg ctg tgc gac gaa tea aca 24 0 

Ser Cys Leu Glu Cys Gly Thr Cys Arg Leu Leu Cys Asp Glu Ser Thr 
65 70 75 80 

eta caa cag tgg cgc tat ccg cct tec gga ttc ggc ate acc tac cgc 28 8 

Leu Gin Gin Trp Arg Tyr Pro Pro Ser Gly Phe Gly lie Thr Tyr Arg 
85 90 95 

ttt gga taa 297 
Phe Gly * 



<210> 243 
<211> 576 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1)...(576) 

<400> 243 

atg ccc etc tta cac ctg etc cgc cag aat ccg gtg att get gee gtt 48 

Met Pro Leu Leu His Leu Leu Arg Gin Asn Pro Val lie Ala Ala Val 
15 10 15 

aaa gat aat gee age ctg caa ctg gca ate gat tct gaa tgc caa ttt 96 
Lys Asp Asn Ala Ser Leu Gin Leu Ala lie Asp Ser Glu Cys Gin Phe 
20 25 30 

att tec gtg ttg tac ggc aat ate tgc acc ate agt aat ate gtt aaa 144 
lie Ser Val Leu Tyr Gly Asn lie Cys Thr lie Ser Asn lie Val Lys 
35 40 45 

aag att aag aac gee ggg aaa tat get ttt att cat gtt gat ttg ctg 192 
Lys He Lys Asn Ala Gly Lys Tyr Ala Phe He His Val Asp Leu Leu 
50 55 60 

gaa ggc gcg tea aat aaa gaa gtc gtt att cag ttt ttg aaa ctg gtg 240 
Glu Gly Ala Ser Asn Lys Glu Val Val He Gin Phe Leu Lys Leu Val 
65 70 75 80 

acc gag gcg gac ggc att ate agt acc aaa gee tea atg ttg aaa gcg 288 
Thr Glu Ala Asp Gly He He Ser Thr Lys Ala Ser Met Leu Lys Ala 
85 90 95 
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gca aga gcg gaa ggt ttt ttc tgt att cat cgc ctg ttt att gtt gat 336 
Ala Arg Ala Glu Gly Phe Phe Cys lie His Arg Leu Phe lie Val Asp 
100 105 110 

tea att teg ttt cac aac att gat aag caa gtt gcg caa teg aat ccg 384 
Ser He Ser Phe His Asn He Asp Lys Gin Val Ala Gin Ser Asn Pro 
115 120 125 

gat tgt att gag ate ctg cca ggc tgt atg ccc aaa gtg ctg ggc tgg 432 
Asp Cys lie Glu He Leu Pro Gly Cys Met Pro Lys Val Leu Gly Trp 
130 135 140 

gtg aca gag aaa ate cgc caa ccg ctg att gee ggt ggg ctg gtg tgc 4 80 

Val Thr Glu Lys He Arg Gin Pro Leu He Ala Gly Gly Leu Val Cys 
145 * 150 155 160 

gat gaa gaa gat gcg cgt aat gcg att aac gcg ggt gtc gtg gcg ctt 528 
Asp Glu Glu Asp Ala Arg Asn Ala He Asn Ala Gly Val Val Ala Leu 
165 170 175 

tec ace acg aat acc ggg gtc tgg acg tta gcg aaa aaa tta ctt tga 576 
Ser Thr Thr Asn Thr Gly Val Trp Thr Leu Ala Lys Lys Leu Leu * 
180 185 190 



<210> 244 
<211> 405 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . . (405) 

<400> 244 

atg ttc atg acc tgg gaa tat gcg eta att ggg tta gtc gtc ggc ate 48 

Met Phe Met Thr Trp Glu Tyr Ala Leu He Gly Leu Val Val Gly He 
15 10 15 

att att ggt get gtg gee atg cgt ttt ggt aat cgt aaa eta cgc cag 96 
He He Gly Ala Val Ala Met Arg Phe Gly Asn Arg Lys Leu Arg Gin 
20 25 30 

caa cag gcg ttg cag tac gaa ctg gaa aag aat aaa get gaa ctg gac 144 
Gin Gin Ala Leu Gin Tyr Glu Leu Glu Lys Asn Lys Ala Glu Leu Asp 
35 40 45 

gag tat cgc gaa gag ctg gtt age cac ttt gee cgc age gcg gaa tta 192 
Glu Tyr Arg Glu Glu Leu Val Ser His Phe Ala Arg Ser Ala Glu Leu 
50 55 60 

ctg gat acc atg gcg cac gac tat cgc cag ctg tat cag cac atg gca 240 
Leu Asp Thr Met Ala His Asp Tyr Arg Gin Leu Tyr Gin His Met Ala 
65 70 75 80 



aaa age tec age age ctg ctg ccg gaa ctg Let get gaa gca aac ccg 
Lys Ser Ser Ser Ser Leu Leu Pro Glu Leu Ser Ala Glu Ala Asn Pro 
85 90 95 



288 



ttc cgt aat cgt ctg gca gag tct gaa gee age aac gat cag gca ccg 336 
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Phe Arg Asn Arg Leu Ala Glu Ser Glu Ala Ser Asn Asp Gin Ala Pro 
100 105 110 



gtg cag atg cct cgc gac tat tct gaa ggc gca tec ggc ctg ctg cgt 
Val Gin Met Pro Arg Asp Tyr Ser Glu Gly Ala Ser Gly Leu Leu Arg 
115 120 125 



384 



act ggc gcg aag cgc gac taa 
Thr Gly Ala Lys Arg Asp * 
130 



405 



<210> 245 
<211> 1368 
<212> DNA 

<213> Escherichia coli 



<220> 
<221> CDS 
<222> (1) . 



(1368) 



<400> 245 

atg aaa aaa caa acc cag ctg ttg agt gca tta gcg tta agt gtc ggg 

Met Lys Lys Gin Thr Gin Leu Leu Ser Ala Leu Ala Leu Ser Val Gly 
1 " 5 10 15 



48 



tta act etc teg gcg tea ttt cag gee gtc gcg teg att cca ggc cag 
Leu Thr Leu Ser Ala Ser Phe Gin Ala Val Ala Ser lie Pro Gly Gin 
20 25 30 



96 



gtt gee gat cag gee cct etc ccc agt ctg get cca atg ctg gaa aaa 
Val Ala Asp Gin Ala Pro Leu Pro Ser Leu Ala Pro Met Leu Glu Lys 
35 40 45 



144 



gtg ctt ccg gca gtg gtg age gta egg gtg gaa gga acg gec agt cag 
Val Leu Pro Ala Val Val Ser Val Arg Val Glu Gly Thr Ala Ser Gin 
50 55 60 



192 



gga cag aaa ate ccg gaa gaa ttc aaa aag ttt ttt ggt gat gat tta 
Gly Gin Lys He Pro Glu Glu Phe Lys Lys Phe Phe Gly Asp Asp Leu 
65 70 75 80 



240 



ccg gat caa cct gca caa ccc ttc gaa ggt tta ggc tec ggt gtc ate 
Pro Asp Gin Pro Ala Gin Pro Phe Glu Gly Leu Gly Ser Gly Val He 
85 90 95 



288 



ate aac gee agt aaa ggc tat gtg ctg acc aac aac cat gtg att aat 
He Asn Ala Ser Lys Gly Tyr Val Leu Thr Asn Asn His Val He Asn 
100 * 105 110 



336 



•cag gca cag aaa ate agt att cag etc aat gat ggg cgc gag ttt gat 
Gin Ala Gin Lys He Ser He Gin Leu Asn Asp Gly Arg Glu Phe Asp 
115 120 125 



384 



gca aaa ctg att ggt age gat gac cag age gat ate gec ctg tta caa 
Ala Lys Leu He Gly Ser Asp Asp Gin Ser Asp He Ala Leu Leu Gin 
130 135 140 



432 



att caa aac ccg age aaa tta acg caa ate get att gec gac tec gat 
He Gin Asn Pro Ser Lys Leu Thr Gin He Ala He Ala Asp Ser Asp 



480 
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145 150 155 160 

aaa ttg cgc gtc ggt gat ttt gcc gta gcg gtc ggt aac cca ttt ggc 528 

Lys Leu Arg Val Gly Asp Phe Ala Val Ala Val Gly Asn Pro Phe Gly 

165 170 175 

ctt ggg caa acc gcc acc tct ggc att gtt tec gca tta ggc cgc age 576 

Leu Gly Gin Thr Ala Thr Ser Gly lie Val Ser Ala Leu Gly Arg Ser 
180 185 190 



ggg ttg aat ctt gaa ggt ctg gaa aac ttt ate cag aca gat get tec 
Gly Leu Asn Leu Glu Gly Leu Glu Asn Phe lie Gin Thr Asp Ala Ser 
195 200 205 



260 265 270 

ggc acc gag atg agt gcc gat ate gcc aaa gcc ttc aac ctt gac gtg 

Gly Thr Glu Met Ser Ala Asp He Ala Lys Ala Phe Asn Leu Asp Val 

275 280 285 



aaa gcg ggc gtc aaa gcg ggc gat att att acc age etc aac ggc aaa 
Lys Ala Gly Val Lys Ala Gly Asp He He Thr Ser Leu Asn Gly Lys 
305 310 315 320 



624 



att aac cgc ggt aac tec ggc ggt gca eta tta aac ctt aac ggt gag 672 
He Asn Arg Gly Asn Ser Gly Gly Ala Leu Leu Asn Leu Asn Gly Glu 
210 215 220 

tta att ggc ate aac act gca ate ctt gcg cct ggc ggc ggg age gtc 720 
Leu He Gly He Asn Thr Ala He Leu Ala Pro Gly Gly Gly Ser Val 
225 230 235 240 

ggg att gga ttt gcc ate ccc agt aat atg gcg cga aca ctg gcg cag 7 68 

Gly He Gly Phe Ala He Pro Ser Asn Met Ala Arg Thr Leu Ala Gin 
245 250 255 

cag ctt ate gac ttt ggt gaa ate aaa cgc ggt ttg tta ggc ate aaa 816 
Gin Leu He Asp Phe Gly Glu He Lys Arg Gly Leu Leu Gly He Lys 



864 



cag cgt ggc gcg ttt gtc age gaa gtg ttg cca ggt tct ggc teg gca 912 
Gin Arg Gly Ala Phe Val Ser Glu Val Leu Pro Gly Ser Gly Ser Ala 
290 295 300 



960 



ccg ctg aat age ttt get gag ttg cgc tct cgt ate gcg acc acc gag 1008 
Pro Leu Asn Ser Phe Ala Glu Leu Arg Ser Arg He Ala Thr Thr Glu 
325 330 335 

ccg ggc acg aaa gtg aag ctt ggc ctg ctg cgt aac ggc aaa cca ctg 1056 
Pro Gly Thr Lys Val Lys Leu Gly Leu Leu Arg Asn Gly Lys Pro Leu 
340 345 350 



gaa gta gaa gtg acg etc. gat acc age acc tct teg teg gcc age get 
Glu Val Glu Val Thr Leu Asp Thr Ser Thr Ser Ser. Ser Ala Ser Ala 



1104 

Ala Ser Ala 

355 360 . 365 



gaa atg ate acg cca gcg ctg gaa ggt gca acg ttg age gat ggt cag 1152 
Glu Met He Thr Pro Ala Leu Glu Gly Ala Thr Leu Ser Asp Gly Gin 
370 375 380 

eta aaa gat ggc ggc aaa ggt att aaa ate gat gaa gtt gtc aaa gga 1200 
Leu Lys Asp Gly Gly Lys Gly He Lys He Asp Glu Val Val Lys Gly • 
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385 390 395 400 

age cca get get cag get ggc ttg caa aaa gac gat gtg ate att ggc 124 8 
Ser Pro Ala Ala Gin Ala Gly Leu Gin Lys Asp Asp Val lie lie Gly 
405 410 415 

gtc aac cgc gat egg gtg aac teg att get gaa atg cgt aaa gtg ctg 1296 
Val Asn Arg Asp Arg Val Asn Ser lie Ala Glu Met Arg Lys Val Leu 
420 425 430 

gcg gca aaa ccg gee ate ate gee ctg caa att gta cgc ggc aat gaa 1344 
Ala Ala Lys Pro Ala lie lie Ala Leu Gin lie Val Arg Gly Asn Glu 
435 440 445 

age ate tat ctg ctg atg cgt taa 1368 
Ser lie Tyr Leu Leu Met Arg * 
450 455 



<210> 246 
<211> 1068 
<212> DNA 

<213> Escherichia coli 



<220> 
<221> CDS 
<222> (1). 



(1068) 



<400> 246 

atg ttt gtg aag etc tta cgt tec gtt gcg att gga tta att gtc ggc 

Met Phe Val Lys Leu Leu Arg Ser Val Ala lie Gly Leu lie Val Gly 
1 5 10 15 



48 



get att ctg ctg gtt gee atg cct teg ctg cgc age ctt aac ccg ctt 
Ala He Leu Leu Val Ala Met Pro Ser Leu Arg Ser Leu Asn Pro Leu 
20 25 30 



96 



tec act ccg caa ttt gac agt ace gat gag acg cct gee age tat aat 
Ser Thr Pro Gin Phe Asp Ser Thr Asp Glu Thr Pro Ala Ser Tyr Asn 
35 40 45 



144 



ctg gcg gtt cgc cgc gee gcg cca gcg gtg gtt aac gtt tac aac cgt 
Leu Ala Val Arg Arg Ala Ala Pro Ala Val Val Asn Val Tyr Asn Arg 
50 55 60 



192 



ggt ttg aac ace aac tct cac aac cag ctt gag ate cgc ace ctg gga 
Gly Leu Asn Thr Asn Ser His Asn Gin Leu Glu He Arg Thr Leu Gly 
65 70 75 80 



240 



tec ggt gta ate atg gat caa cgc ggt tat ate ate ace aat aaa cac 
Ser Gly Val He Met Asp Gin Arg Gly Tyr He He Thr Asn Lys His 
85 90 95 



288 



gtc ate aac gac gee gat cag ate ate gtc gee tta cag gat gga cgt 
Val He Asn Asp Ala Asp Gin He He Val Ala Leu Gin Asp Gly Arg 
100 105 110 



336 



gta ttt gaa gca ttg ctg gtg gga tct gac tct eta acc gat ctg gcg 
Val Phe Glu Ala Leu Leu Val Gly Ser Asp Ser Leu Thr Asp Leu Ala 
115 120 125 



384 
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gta ctt aaa att aat gcc act ggc ggt tta cct acc att cca att aat 
Val Leu Lys He Asn Ala Thr Gly Gly Leu Pro Thr He Pro He Asn 
130 135 140 

gca cgt cgc gta ccg cac att ggc gac gta gta ctg gcg ate ggt aac 
Ala Arg Arg Val Pro His He Gly Asp Val Val Leu Ala He Gly Asn 
145 ~ " 150 155 160 

ccg tac aac etc ggg cag acc att acc cag ggg att att agt gcc acg 
Pro Tyr Asn Leu Gly Gin Thr He Thr Gin Gly He He Ser Ala Thr 
165 170 175 



gat get tec att aac cac ggt aac tct ggc ggc gcg ctg gtg aac teg 
Asp Ala Ser He Asn His Gly Asn Ser Gly Gly Ala Leu Val Asn Ser 
195 200 205 

ctg ggc gaa ctg atg ggc att aat acg ctg teg ttt gat aag agt aac 
Leu Gly Glu Leu Met Gly He Asn Thr Leu Ser Phe Asp Lys Ser Asn 
210 215 220 

gat ggc gaa acg ccg gaa ggt ate ggc ttt gcg att cct ttc cag tta 
Asp Gly Glu Thr Pro Glu Gly He Gly Phe Ala He Pro Phe Gin Leu 
225 230 235 240 

gca acc aaa att atg gat aag ctg ate cgc gat ggt cgc gtg ate cgc 
Ala Thr Lys He Met Asp Lys Leu He Arg Asp Gly Arg Val He Arg 
245 250 255 

ggc tac att ggt ate ggc gga cgt gag ate gca cca ctg cac gcg cag 
Gly Tyr He Gly lie Gly Gly Arg Glu He Ala Pro Leu His Ala Gin 
260 265 270 

ggc ggt ggt ata gat caa ctg caa ggg ate gtg gtt aat gaa gtg tea 
Gly Gly Gly He Asp Gin Leu Gin Gly He Val Val Asn Glu Val Ser 
275 280 285 

cct gac ggc ccg gcg gcg aat gcg ggt att cag gtc aac gat ctg att 
Pro Asp Gly Pro Ala Ala Asn Ala Gly He Gin Val Asn Asp Leu He 
290 295 300 

att teg gtg gat aac aaa ccg gcc ate tct get ctg gag acg atg gat 
He Ser Val Asp Asn Lys Pro Ala He Ser Ala Leu Glu Thr Met Asp 
305 310 315 320 

cag gtg gcg gaa att cgc cct ggt teg gtg ate cct gta gta gtg atg 
Gin Val Ala Glu He Arg Pro Gly Ser Val He Pro Val Val Val Met 
325 330 335 

cgt gat gat aag cag tta acg ctg cag gtc acc att cag gaa tat ccg 
Arg Asp Asp Lys Gin Leu Thr Leu Gin Val. Thr He Gin Glu Tyr Pro 
340 345 350 



432 



480 



528 



ggt cga ate ggt ctg aac ccg acc ggg egg caa aac ttc etc caa acc 57 6 

Gly Arg He Gly Leu Asn Pro Thr Gly Arg Gin Asn Phe Leu Gin Thr 
180 185 190 



624 



672 



720 



768 



816 



864 



912 



960 



1008 



1056 



gca acc aat taa 1068 
Ala Thr Asn * 
355 
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<210> 247 
<211> 357 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . , . (357) 

<400> 247 

gtg gcc cgt ata gca ggc att aac att cct gat cat aag cat gcc gta 4 8 

Met Ala Arg lie Ala Gly lie Asn lie Pro Asp His Lys His Ala Val 

15 10 15 

ate gca tta act teg att tat ggc gtc ggc aag acc cgt tct aaa gcc 96 
lie Ala Leu Thr Ser He Tyr Gly Val Gly Lys Thr Arg Ser Lys Ala 
20 25 30 

ate ctg get gca gcg ggt ate get gaa gat gtt aag ate agt gag ctg 14 4 

He Leu Ala Ala Ala Gly He Ala Glu Asp Val Lys He Ser Glu Leu 
35 40 45 

tct gaa gga caa ate gac acg ctg cgt gac gaa gtt gcc aaa ttt gtc 192 
Ser Glu Gly Gin He Asp Thr Leu Arg Asp Glu Val Ala Lys Phe Val 
50 ' 55 60 

gtt gaa ggt gat ctg cgc cgt gaa ate age atg age ate aag cgc ctg 24 0 

Val Glu Gly Asp Leu Arg Arg Glu lie Ser Met Ser He Lys Arg Leu 
65 70 75 80 

atg gat ctt ggt tgc tat cgc ggt ttg cgt cat cgt cgt ggt etc ccg 288 
Met Asp Leu Gly Cys Tyr Arg Gly Leu Arg His Arg Arg Gly Leu Pro 
85 90 95 

gtt cgc ggt cag cgt acc aag acc aac gca cgt acc cgt aag ggt ccg 336 
Val Arg Gly Gin Arg Thr Lys Thr Asn Ala Arg Thr Arg Lys Gly Pro 
100 105 110 

cgc aaa ccg ate aag aaa taa 357 
Arg Lys Pro He Lys Lys * 
115 



<210> 248 
<211> 390 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . . (390) 

<400> 248 

atg gca aag gca cca att cgt gca cgt aaa cgt gta aga aaa caa gtc 48 

Met Ala Lys Ala Pro He Arg Ala Arg Lys Arg Val Arg Lys Gin Val 
15 10 15 

tct gac ggc gtg get cat ate cat get tct ttc aac aac acc ate gtg 96 
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Ser Asp Gly Val Ala His lie His Ala Ser Phe Asn Asn Thr lie Val 
20 25 30 

act ate act gat cgt cag ggt aac gcg ttg ggt tgg gca aca gec ggt 144 
Thr He Thr Asp Arg Gin Gly Asn Ala Leu Gly Trp Ala Thr Ala Gly 
35 40 45 

ggt tec ggt ttc cgt ggt tct cgc aaa tec act ccg ttt gca get cag 192 
Gly Ser Gly Phe Arg Gly Ser Arg Lys Ser Thr Pro Phe Ala Ala Gin 
50 " 55 60 

gtt gca gca gag cgt tgc get gac gec gtg aaa gaa tac ggc ate aag 240 
Val Ala Ala Glu Arg Cys Ala Asp Ala Val Lys Glu Tyr Gly He Lys 
65 70 75 80 

aat ctg gaa gtt atg gtt aaa ggt ccg ggt cca ggc cgc gaa tct act 288 
Asn Leu Glu Val Met Val Lys Gly Pro Gly Pro Gly Arg Glu Ser Thr 
85 90 95 

att cgt get ctg aac gec gca ggt' ttc cgc ate act aac att act gat 336 
He Arg Ala Leu Asn Ala Ala Gly Phe Arg He Thr Asn He Thr Asp 
100 105 HO 

gtg act ccg ate cct cat aac ggt tgt cgt ccg ccg aaa aaa cgt cgc 384 
Val Thr Pro He Pro His Asn Gly Cys Arg Pro Pro Lys Lys Arg Arg 
115 120 125 



gta taa 
Val * 



390 



<210> 249 
<211> 621 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . . (621) 

<400> 249 

atg gca aga tat ttg ggt cct aag etc aag ctg age cgt cgt gag ggc 4 8 

Met Ala Arg Tyr Leu Gly Pro Lys Leu Lys Leu Ser Arg Arg Glu Gly 

15 10 15 

ace gac tta ttc ctt aag tct ggc gtt cgc gcg ate gat ace aag tgt 96 
Thr Asp Leu Phe Leu Lys Ser Gly Val Arg Ala He Asp Thr Lys Cys 
20 25 30 

aaa att gaa caa get cct ggc cag cac ggt gcg cgt aaa ccg cgt ctg 14 4 

Lys He Glu Gin Ala Pro Gly Gin His Gly Ala Arg Lys Pro Arg Leu 
35 40 45 

tct gac tat ggt gtg cag ttg cgt gaa aag caa aaa gtt cgc cgt ate 192 
Ser Asp Tyr Gly Val Gin Leu Arg Glu Lys Gin Lys Val Arg Arg He 
50 55 60 

tat ggt gtg ctg gag cgt cag ttc cgt aac tac tac aaa gaa gca gca 24 0 

Tyr Gly Val Leu Glu Arg Gin Phe Arg Asn Tyr Tyr Lys Glu Ala Ala 
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65 70 75 80 

cgt ctg aaa ggc aac acc ggt gaa aac ctg ttg get ctg ctg gaa ggt 288 

Arg Leu Lys Gly Asn Thr Gly Glu Asn Leu Leu Ala Leu Leu Glu Gly 
85 90 95 



cgt ctg gac aac gtt gta tac cgt atg ggc ttc ggt gec act cgt gca 336 

Arg Leu Asp Asn Val Val Tyr Arg Met Gly Phe Gly Ala Thr Arg Ala 

100 105 110 

gaa gca cgt cag ctg gtt age cat aaa gca att atg gta aac ggt cgt 384 

Glu Ala Arg Gin Leu Val Ser His Lys Ala lie Met Val Asn Gly Arg 

115 120 125 

gtt gtt aac ate get tct tat cag gtt agt ccg aat gac gtt gta age 432 

Val Val Asn lie Ala Ser Tyr Gin Val Ser Pro Asn Asp Val Val Ser 

130 135 140 

att cgt gag aaa gcg aag aag cag tct cgc gtg aaa gee get ctg gag 480 

lie Arg Glu Lys Ala Lys Lys Gin Ser Arg Val Lys Ala Ala Leu Glu 

145 150 155 160 

ctg get gag cag cgt gaa aag cca acc tgg ctg gaa gtt gat get ggc 528 

Leu Ala Glu Gin Arg Glu Lys Pro Thr Trp Leu Glu Val Asp Ala Gly 

165 170 175 

aag atg gaa ggt acg ttt aag cgt aag ccg gag cgt tct gat ctg tct 576 

Lys Met Glu Gly Thr Phe Lys Arg Lys Pro Glu Arg Ser Asp Leu Ser 

180 185 190 

gcg gac att aac gaa cac ctg ate gtc gag ctt tac tec aag taa 621 

Ala Asp lie Asn Glu His Leu lie Val Glu Leu Tyr Ser Lys * 

195 200 205 



<210> 250 
<211> 990 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . . (990) 

<400> 250 

atg cag ggt tct gtg aca gag ttt eta aaa ccg cgc ctg gtt gat ate 48 

Met Gin Gly Ser Val Thr Glu Phe Leu Lys Pro Arg Leu Val Asp lie 

15 10 15 

gag caa gtg agt teg acg cac gee aag gtg acc ctt gag cct tta gag 96 
Glu Gin Val Ser Ser Thr His Ala Lys Val Thr Leu Glu Pro Leu Glu 
20 25 30 

cgt ggc ttt ggc cat act ctg ggt aac gca ctg cgc cgt att ctg etc 144 
Arg Gly Phe Gly His Thr Leu Gly Asn Ala Leu Arg Arg lie Leu Leu 
35 40 45 

tea teg atg ccg ggt tgc gcg gtg acc gag gtt gag att gat ggt gta 192 
Ser Ser Met Pro Gly Cys Ala Val Thr Glu Val Glu He Asp Gly Val 
50 . 55 60 
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eta cat gag tac age acc aaa gaa ggc gtt cag gaa gat ate ctg gaa 240 
Leu His Glu Tyr Ser Thr Lys Glu Gly Val Gin Glu Asp lie Leu Glu 
65 70 75 80 

ate ctg etc aac ctg aaa ggg ctg gcg gtg aga gtt cag ggc aaa gat 288 
lie Leu Leu Asn Leu Lys Gly Leu Ala Val Arg Val Gin Gly Lys Asp 
85 90 95 

gaa gtt att ctt acc ttg aat aaa tct ggc att ggc cct gtg act gca 336 
Glu Val He Leu Thr Leu Asn Lys Ser Gly He Gly Pro Val Thr Ala 
100 105 110 



gec gat ate acc cac gac ggt gat gtc gaa ate gtc aag ccg cag cac 
Ala Asp He Thr His Asp Gly Asp Val Glu He Val Lys Pro Gin His 
115 120 125 



gaa cag cgt acc gac ctg gac aag ctg gtc ate gaa atg gaa acc aac 
Glu Gin Arg Thr Asp Leu Asp Lys Leu Val He Glu Met Glu Thr Asn 
195 200 205 



384 



gtg ate tgc cac ctg acc gat gag aac gcg tct att age atg cgt ate 4 32 

Val He Cys His Leu Thr Asp Glu Asn Ala Ser He Ser Met Arg He 
130 135 140 

aaa gtt cag cgc ggt cgt ggt tat gtg ccg get tct acc cga att cat 4 80 

Lys Val Gin Arg Gly Arg Gly Tyr Val Pro Ala Ser Thr Arg lie His 
145 150 " 155 160 

teg gaa gaa gat gag cgc cca ate ggc cgt ctg ctg gtc gac gca tgc 528 
Ser Glu Glu Asp Glu Arg Pro He Gly Arg Leu Leu Val Asp Ala Cys 
165 ' 170 175 

tac age cct gtg gag cgt att gee tac aat gtt gaa gca gcg cgt gta 576 
Tyr Ser Pro Val Glu Arg He Ala Tyr Asn Val Glu Ala Ala Arg Val 
180 185 190 



624 



ggc aca ate gat cct gaa gag gcg att cgt cgt gcg gca acc att ctg 672 
Gly Thr He Asp Pro Glu Glu Ala He Arg Arg Ala Ala Thr He Leu 
210 215 220 

get gaa caa ctg gaa get ttc gtt gac tta cgt gat gta cgt cag cct 720 
Ala Glu Gin Leu Glu Ala Phe Val Asp Leu Arg Asp Val Arg Gin Pro 
225 230 235 240 

gaa gtg aaa gaa gag aaa cca gag ttc gat ccg ate ctg ctg cgc cct 768 
Glu Val Lys Glu Glu Lys Pro Glu Phe Asp Pro He Leu Leu Arg Pro 
245 250 255 

gtt gac gat ctg gaa ttg act gtc cgc tct get aac tgc ctt aaa gca 816 
Val Asp Asp Leu Glu Leu Thr Val Arg Ser Ala Asn Cys Leu Lys Ala 
260 265 270 

gaa get ate cac tat ate ggt gat ctg gta cag cgt acc gag gtt gag 8 64 

Glu Ala He His Tyr He Gly Asp Leu Val Gin Arg Thr Glu Val Glu 
275 280 285 

etc ctt aaa acg cct aac ctt ggt aaa aaa tct ctt act gag att aaa 912 
Leu Leu Lys Thr Pro Asn Leu Gly Lys Lys Ser Leu Thr Glu He Lys 
290 295 300 
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gac gtg ctg get tec cgt gga ctg tct ctg ggc atg cgc ctg gaa aac 960 
Asp Val Leu Ala Ser Arg Gly Leu Ser Leu Gly Met Arg Leu Glu Asn 
305 310 315 320 

tgg cca ccg gca age ate get gac gag taa 990 
Trp Pro Pro Ala Ser lie Ala Asp Glu * 
325 



<210> 251 
<211> 384 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . . (384) 

<400> 251 

atg cgc cat cgt aag agt ggt cgt caa ctg aac cgc aac age age cat 48 

Met Arg His Arg Lys Ser Gly Arg Gin Leu Asn Arg Asn Ser Ser His 
15 10 15 

cgc cag get atg ttc cgc aat atg gca ggt tea ctg gtt cgt cat gaa 96 
Arg Gin Ala Met Phe Arg Asn Met Ala Gly Ser Leu Val Arg His Glu 
20 25 ~ 30 

ate ate aag acg act ctg cct aaa gcg aaa gag ctg cgc cgc gta gtt 144 
lie lie Lys Thr Thr Leu Pro Lys Ala Lys Glu Leu Arg Arg Val Val 
35 40 45 

gag ccg ctg att act ctt gee aag act gat age gtt get aat cgt cgt 192 
Glu Pro Leu lie Thr Leu Ala Lys Thr Asp Ser Val Ala Asn Arg Arg 
50 55 60 

ctg gca ttc gec cgt act cgt gat aac gag ate gtg gca aaa ctg ttt 240 
Leu Ala Phe Ala Arg Thr Arg Asp Asn Glu lie Val Ala Lys Leu Phe 
65 70 75 80 

aac gaa ctg ggc ccg cgt ttc gcg age cgt gee ggt ggt tac act cgt 288 
Asn Glu Leu Gly Pro Arg Phe Ala Ser Arg Ala Gly Gly Tyr Thr Arg 
85 90 95 

att ctg aag tgt ggc ttc cgt gca ggc gac aac gcg ccg atg get tac 336 
lie Leu Lys Cys Gly Phe Arg Ala Gly Asp Asn Ala Pro Met Ala Tyr 
100 105 110 

ate gag ctg gtt gat cgt tea gag aaa gca gaa get get gca gag taa 384 
lie Glu Leu Val Asp Arg Ser Glu Lys Ala Glu Ala Ala Ala Glu * 
115 120 125 



<210> 252 
<211> 141 
<212> DNA 

<213> Escherichia coli 

<220> 
<221> CDS 
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<222> (1) . . . (141) 
<400> 252 

atg aaa cgc act ttt caa ccg tct gta ctg aag cgc aac cgt tct cac 48 
Met Lys Arg Thr Phe Gin Pro Ser Val Leu Lys Arg Asn Arg Ser His 
1 5 10 15 

ggc ttc cgt get cgt atg get act aaa aat ggt cgt cag gtt ctg gca 96 
Gly Phe Arg Ala Arg Met Ala Thr Lys Asn Gly Arg Gin Val Leu Ala 
20 " 25 30 

cgt cgt cgt get aaa ggc cgc get cgt ctg ace gtt tct aag taa 141 
Arg Arg Arg Ala Lys Gly Arg Ala Arg Leu Thr Val Ser Lys * 
35 40 45 



<210> 253 
<211> 360 
<212> DNA 

<213> Escherichia coli 

<220> ... . 

<221> CDS 

<222> (1) . . . (360) 

<400> 253 

gtg gtt aag etc gca ttt ccc agg gag tta cgc ttg tta act ccc agt 48 
Met Val Lys Leu Ala Phe Pro Arg Glu Leu Arg Leu Leu Thr Pro Ser 
1 5 10 15 

caa ttc aca ttc gtc ttc cag cag cca caa egg get ggc acg ccg caa 96 
Gin Phe Thr Phe Val Phe Gin Gin Pro Gin Arg Ala Gly Thr Pro Gin 
20 25 30 

att ace att etc ggc cgc ctg aat teg ctg ggg cat ccc cgt ate ggt 144 
lie Thr lie Leu Gly Arg Leu Asn Ser Leu Gly His Pro Arg lie Gly 
35 40 45 

ctt aca gtc gee aag aaa aac gtt cga cgc gee cat gaa cgc aat egg 192 
Leu Thr Val Ala Lys Lys Asn Val Arg Arg Ala His Glu Arg Asn Arg 
50 55 60 

att aaa cgt ctg acg cgt gaa age ttc cgt ctg cgc caa cat gaa etc 240 
lie Lys Arg Leu Thr Arg Glu Ser Phe Arg Leu Arg Gin His Glu Leu 
65 ~ 70 75 80 

ccg get atg gat ttc gtg gtg gtg gcg aaa aaa ggg gtt gee gac etc 288 
Pro Ala Met Asp Phe Val Val Val Ala Lys Lys Gly Val Ala Asp Leu 
85 90 95 

gat aac cgt get etc teg gaa gcg ttg gaa aaa tta tgg cgc cgc cac 336 
Asp Asn Arg Ala Leu Ser Glu Ala Leu Glu Lys Leu Trp Arg Arg His 
100 105 110 



tgt cgc ctg get cgc ggg tec tga 
Cys Arg Leu Ala Arg Gly Ser * 
115 



360 



<210> 254 
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<211> 1776 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . - . (1776) 

<400> 254 

gtg ate gaa aaa ttg cgt aat ate gec ate ate gcg cac gta gac cat 4 8 

Met lie Glu Lys Leu Arg Asn He Ala He He Ala His Val Asp His 

15 10 15 

ggt aaa acc acc ctg gta gac aag ctg etc caa caa tec ggt acg ttc 96 
Gly Lys Thr Thr Leu Val Asp Lys Leu Leu Gin Gin Ser Gly Thr Phe 
* 20 2 5 30 

gac tct cgt gec gaa acc caa gag cgc gtg atg gac tec aac gat ttg 144 
Asp Ser Arg Ala Glu Thr Gin Glu Arg Val Met Asp Ser Asn Asp Leu 
35 40 ~ 45 

gag aaa gag cgt ggg att acc ate etc gcg aaa aac acc get ate aaa 192 
Glu Lys Glu Arg Gly He Thr He Leu Ala Lys Asn Thr Ala He Lys 
50 55 60 

tgg aat gat tac cgt ate aac ate gtt gat acc ccg ggg cac gee gac 240 
Trp Asn Asp Tyr Arg He Asn He Val Asp Thr Pro Gly His Ala Asp 
65 70 75 80 

ttc ggt ggt gaa gtt gaa cgt gta atg tec atg gta gac tea gtg ctg 288 
Phe Gly Gly Glu Val Glu Arg Val Met Ser Met Val Asp Ser Val Leu 
85 90 95 

ctg gtg gtt gac gca ttt gac ggc ccg atg ccg caa acg cgc ttc gta 336 
Leu Val Val Asp Ala Phe Asp Gly Pro Met Pro Gin Thr Arg Phe Val 
100 105 110 

acc aaa aaa gcg ttt get tac ggc ctg aag ccg att gtt gtt ate aac 384 
Thr Lys Lys Ala Phe Ala Tyr Gly Leu Lys Pro He Val Val He Asn 
115 120 125 

aaa gtt gac cgc cct ggc gcg cgt cct gat tgg gtt gtg gat cag gta 432 
Lys Val Asp Arg Pro Gly Ala Arg Pro Asp Trp Val Val Asp Gin Val 
130 135 140 

ttc gat ctg ttc gtt aac etc gac gcg acc gac gag cag ctg gac ttc 480 
Phe Asp Leu Phe Val Asn Leu Asp Ala Thr Asp Glu Gin Leu Asp Phe 
145 " 150 155 160 

ccg ate gtt tac get tct gcg ctg aac ggt ate gcg ggt ctg gac cac 528 
Pro He Val Tyr Ala Ser Ala Leu Asn Gly He Ala Gly Leu Asp His 
165 170 175 

gaa gat atg gcg gaa gac atg acc ccg ctg tac cag gcg att gtt gac 57 6 

Glu Asp Met Ala Glu Asp Met Thr Pro Leu Tyr Gin Ala He Val Asp 
180 185 190 

cac gtt cct gcg ccg gac gtt gac ctt gac ggt ccg ttc cag atg cag 624 
His Val Pro Ala Pro Asp Val Asp Leu Asp Gly Pro Phe Gin Met Gin 
195 200 205 
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att tct cag etc gat tac aac age tat gtt ggc gtt ate ggc att ggc 672 
lie Ser Gin Leu Asp Tyr Asn Ser Tyr Val Gly Val He Gly He Gly 
210 215 220 

cgc ate aag cgc ggt aaa gtg aag ccg aac cag cag gtc act ate ate 720 
Arg He Lys Arg Gly Lys Val Lys Pro Asn Gin Gin Val Thr He He 
225 " 230 * 235 240 

gat age gaa ggc aaa ace cgc aac gcg aaa gtc ggt aaa gtg ctg ggc 7 68 

Asp Ser Glu Gly Lys Thr Arg Asn Ala Lys Val Gly Lys Val Leu Gly 
245 250 255 

cac etc ggt ctg gaa cgt ate gaa acc gat ctg gcg gaa get ggc gat 816 
His Leu Gly Leu Glu Arg He Glu Thr Asp Leu Ala Glu Ala Gly Asp 
260 265 270 

ate gtt gcg ate acg ggc ctt ggc gaa ctg aac att tct gac acc gtt 864 
He Val Ala He Thr Gly Leu Gly Glu Leu Asn He Ser Asp Thr Val 
275 280 285 

tgc gac acg caa aac gtt gaa gcg ctg ccg gca etc tec gtt gat gag 912 
Cys Asp Thr Gin Asn Val Glu Ala Leu Pro Ala Leu Ser Val Asp Glu 
290 295 300 

ccg acc gtt tct atg ttc ttc tgc gtt aac acc teg ccg ttc tgc ggt 960 
Pro Thr Val Ser Met Phe Phe Cys Val Asn Thr Ser Pro Phe Cys Gly 
305 310 315 320 

aaa gaa ggt aag ttc gta acg tct cgt cag ate ctg gat cgt ctg aac 1008 
Lys Glu Gly Lys Phe Val Thr Ser Arg Gin He Leu Asp Arg Leu Asn 
325 330 335 

aaa gaa ctg gta cac aac gtt gcg ctg cgc gta gaa gaa acc gaa gac 1056 
Lys Glu Leu Val His Asn Val Ala Leu Arg Val Glu Glu Thr Glu Asp 
340 345 350 

gee gat gcg ttc cgc gtt tct ggt cgt ggc gaa ctg cac ctg tct gtt 1104 
Ala Asp Ala Phe Arg Val Ser Gly Arg Gly Glu Leu His Leu Ser Val 
355 360 365 

ctg ate gaa aac atg cgt cgt gaa ggt ttc gaa ctg gcg gta tec cgt 1152 
Leu He Glu Asn Met Arg Arg Glu Gly Phe Glu Leu Ala Val Ser Arg 
370 375 380 

ccg aaa gtt ate ttc cgt gaa ate gac ggt cgt aaa caa gag ccg tat 1200 
Pro Lys Val He Phe Arg Glu He Asp Gly Arg Lys Gin Glu Pro Tyr 
385 390 395 400 

gaa aac gtg act ctg gac gtt gaa gaa cag cat cag ggt tct gta atg 1248 
Glu Asn Val Thr Leu Asp Val Glu Glu Gin His Gin Gly Ser Val Met 
405 410 415 

cag gcg ctg ggc gaa cgt aaa ggc gac ctg aaa aac atg aat cca gac 1296 
Gin Ala Leu Gly Glu Arg Lys Gly Asp Leu Lys Asn Met Asn Pro Asp 
420 . 425 430 

ggt aaa ggc cgc gta cgt etc gac tac gtg ate cca age cgt ggt ctg 1344 
Gly Lys Gly Arg Val Arg Leu Asp Tyr Val He Pro Ser Arg Gly Leu 
435 440 445 
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att ggc ttc cgt tct gag ttc atg acc atg act tec ggt act ggt ctg 1392 
lie Gly Phe Arg Ser Glu Phe Met Thr Met Thr Ser Gly Thr Gly Leu 
450 455 460 

ctg tac tec acc ttc age cac tac gac gac gta cgt ccg ggt gaa gtg 1440 
Leu Tyr Ser Thr Phe Ser His Tyr Asp Asp Val Arg Pro Gly Glu Val 
465 470 475 480 

ggt cag cgt cag aac ggc gta ctg ate tct aac ggt cag ggt aaa gcg 1488 
Gly Gin Arg Gin Asn Gly Val Leu lie Ser Asn Gly Gin Gly Lys Ala 
485 490 495 

gtc gcg ttc gcg ctg ttc ggt ctg cag gat cgc ggt aag ctg ttc etc 1536 
Val Ala Phe Ala Leu Phe Gly Leu Gin Asp Arg Gly Lys Leu Phe Leu 
500 505 510 

ggt cac ggt gca gaa gtt tac gaa ggt cag att ate ggt att cat age 1584 
Gly His Gly Ala Glu Val Tyr Glu Gly Gin He He Gly He His Ser 
515 520 525 

cgc tct aac gac ctg act gta aac tgc ctg acc ggt aag aaa ctg acc 1632 
Arg Ser Asn Asp Leu Thr Val Asn Cys Leu Thr Gly Lys Lys Leu Thr 
530 535 540 

aac atg cgt get tec ggt act gac gaa gcc gtt gtt ctg gtt ccg cct 1680 
Asn Met Arg Ala Ser Gly Thr Asp Glu Ala Val Val Leu Val Pro Pro 
545 550 555 560 

ate cgc atg act ctg gaa caa get ctg gag ttc ate gat gat gac gaa 1728 
lie Arg Met Thr Leu Glu Gin Ala Leu Glu Phe lie Asp Asp Asp Glu 
565 570 575 

ctg gta gaa gtg act ccg acc tct ate cgt att cgt aaa cgt cac tga 1776 
Leu Val Glu Val Thr Pro Thr Ser He Arg He Arg Lys Arg His * 
580 585 590 



<210> 255 
<211> 2271 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . . (2271) 

<400> 255 

atg atg aaa gta tta att gtt gaa age gag ttt etc cat caa gac acc 4 8 

Met Met Lys Val Leu He Val Glu Ser Glu Phe Leu His Gin Asp Thr 
1 5 10 15 

tgg gtc ggt aac gcc gtt gag cgt ctg gca gat get tta age cag caa 96 
Trp Val Gly Asn Ala Val Glu Arg Leu Ala Asp Ala Leu Ser Gin Gin 
20 25 30 

aat gtt acc gtg att aaa tec acc tec ttt gat gat ggt ttt gcc att 14 4 

Asn Val Thr Val He Lys Ser Thr Ser Phe Asp Asp Gly Phe Ala He 
35 40 45 
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etc tct tea aac gaa gec att gac tgc ctg atg ttc age tat caa atg 192 
Leu Ser Ser Asn Glu Ala He Asp Cys Leu Met Phe Ser Tyr Gin Met 
50 55 60 

gaa cat ccg gac gaa cat caa aac gtc aga caa ttg ate ggt aag ctt 240 
Glu His Pro Asp Glu His Gin Asn Val Arg Gin Leu lie Gly Lys Leu 
65 70 75 80 

cat gag cgc caa caa aac gtg ccg gtc ttc ctg ttg ggc gat egg gaa 288 
His Glu Arg Gin Gin Asn Val Pro Val Phe Leu Leu Gly Asp Arg Glu 
85 90 95 

aaa gee etc gec gca atg gat cgc gac ctg ctg gag ctt gtc gat gaa 336 
Lys Ala Leu Ala Ala Met Asp Arg Asp Leu Leu Glu Leu Val Asp Glu 
100 105 110 

ttc gee tgg att ctg gaa gat ace gee gac ttt ate gec gga cgc gee 384 
Phe Ala Trp He Leu Glu Asp Thr Ala Asp Phe He Ala Gly Arg Ala 
115 120 125 

gtt gec gcg atg acc cgc tac cgc cag cag ctg ttg ccg cca ctg ttc 4 32 

Val Ala Ala Met Thr Arg Tyr Arg Gin Gin Leu Leu Pro Pro Leu Phe 
130 135 140 

age gcg ctg atg aaa tat agt gac ate cat gaa tat tec tgg gca gcg 480 
Ser Ala Leu Met Lys Tyr Ser Asp He His Glu Tyr Ser Trp Ala Ala 
145 150 155 160 

cca ggc cac cag ggc ggc gtt ggt ttt acc aaa aca ccc gec gga cgt 528 
Pro Gly His Gin Gly Gly Val Gly Phe Thr Lys Thr Pro Ala Gly Arg 
165 170 175 

ttc tac cat gac tac tat ggt gaa aat ctg ttc cgc acc gac atg ggc 576 
Phe Tyr His Asp Tyr Tyr Gly Glu Asn Leu Phe Arg Thr Asp Met Gly 
180 185 190 

ate gaa cga act tec etc ggt tct ttg ctt gac cat act ggc gca ttt 624 
He Glu Arg Thr Ser Leu Gly Ser Leu Leu Asp His Thr Gly Ala Phe 
195 200 205 

ggc gaa age gaa aaa tat gec gca cgc gta ttt ggt gee gat cgc tec 672 
Gly Glu Ser Glu Lys Tyr Ala Ala Arg Val Phe Gly Ala Asp Arg Ser 
210 215 220 

tgg teg gta gtc gtc ggt act tec ggc tct aac cgc acc ate atg cag 720 
Trp Ser Val Val Val Gly Thr Ser Gly Ser Asn Arg Thr He Met Gin 
225 230 235 240 

get tgc atg acc gat aac gat gtc gtg gtc gtt gac cgt aac tgc cat 768 
Ala Cys Met Thr Asp Asn Asp Val Val Val Val Asp Arg Asn Cys His 
245 250 255 

aaa tec ate gaa caa ggt ttg atg ctg aca ggc gcg aaa ccg gtc tat 816 
Lys Ser lie Glu Gin Gly Leu Met Leu Thr Gly Ala Lys Pro Val Tyr 
260 265 270 

atg gtg cca age cgc aac cgc tac ggc att ate ggg cca ate tat ccg 864 
Met Val Pro Ser Arg Asn Arg Tyr Gly He He Gly Pro He Tyr Pro 
275 280 285 
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cag gaa atg caa cct gaa acc ttg cag aag aaa ate agt gaa age ccg 912 
Gin Glu Met Gin Pro Glu Thr Leu Gin Lys Lys lie Ser Glu Ser Pro 
290 295 300 

ctg acc aaa gac aaa gec ggg caa aaa ccg tct tac tgc gtg gtg acc 960 
Leu Thr Lys Asp Lys Ala Gly Gin Lys Pro Ser Tyr Cys Val Val Thr 
305 310 315 320 

aac tgc acc tat gac ggc gtg tgt tat aac get aaa gaa gcg cag gat 1008 
Asn Cys Thr Tyr Asp Gly Val Cys Tyr Asn Ala Lys Glu Ala Gin Asp 
325 330 335 

ctg ctg gaa aaa acc tec gat cgt ctg cac ttt gac gaa gee tgg tac 1056 
Leu Leu Glu Lys Thr Ser Asp Arg Leu His Phe Asp Glu Ala Trp Tyr 
340 345 350 

ggc tat gca cgt ttc aac ccg ate tat gec gat cac tat gec atg cgc 1104 
Gly Tyr Ala Arg Phe Asn Pro lie Tyr Ala Asp His Tyr Ala Met Arg 
355 360 " " 365 

ggc gaa cct ggc gat cac aac ggt cct acc gtt ttc gee acc cac tec 1152 
Gly Glu Pro Gly Asp His Asn Gly Pro Thr Val Phe Ala Thr His Ser 
370 375 380 

acc cac aaa ctg ctg aat gcg ctg tea cag get tct tat att cat gta 1200 
Thr His Lys Leu Leu Asn Ala Leu Ser Gin Ala Ser Tyr lie His Val 
385 " 390 395 400 

cgt gaa ggt cgt ggg gcg att aac ttc tec cgc ttc aac cag gee tac 1248 
Arg Glu Gly Arg Gly Ala lie Asn Phe Ser Arg Phe Asn Gin Ala Tyr 
405 410 415 

atg atg cat gee acc acc tec ccg ctg tat gee ate tgc gca tec aac 1296 
Met Met His Ala Thr Thr Ser Pro Leu Tyr Ala lie Cys Ala Ser Asn 
420 425 430 

gac gtg gcg gtg teg atg atg gac ggc aac age ggc ctg tea ctg aca 1344 
Asp Val Ala Val Ser Met Met Asp Gly Asn Ser Gly Leu Ser Leu Thr 
435 440 445 

cag gaa gtg att gac gaa gcg gtt gat ttc cgt cag gcg atg gcg egg 1392 
Gin Glu Val lie Asp Glu Ala Val Asp Phe Arg Gin Ala Met Ala Arg 
450 455 460 

eta tat aaa gag ttc acc get gac ggt age tgg ttc ttc aaa ccg tgg 1440 
Leu Tyr Lys Glu Phe Thr Ala Asp Gly Ser Trp Phe Phe Lys Pro Trp 
465 470 475 480 

aac aaa gaa gtc gtc acc gac cca caa acc ggc aaa acc tat gac ttt 1488 
Asn Lys Glu Val Val Thr Asp Pro Gin Thr Gly Lys Thr Tyr Asp Phe 
485 490 495 

get gac gca cca acc aaa ctg ctg acc acc gtt cag gac tgc tgg gta 1536 
Ala Asp Ala Pro Thr Lys Leu Leu Thr Thr Val Gin Asp Cys Trp Val 
500 505 510 

atg cat ccg ggc gaa age tgg cac ggc ttc aaa gat att ccg gat aac 1584 
Met His Pro Gly Glu Ser Trp His Gly Phe Lys Asp lie Pro Asp Asn 
515 520 525 
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tgg agt atg etc gac ccg att aaa gtc age ate ctt get ccg gga atg 1632 
Trp Ser Met Leu Asp Pro lie Lys Val Ser He Leu Ala Pro Gly Met 
530 535 540 

ggt gaa gat ggt gaa ctg gaa gaa acc ggt gtt ccg gcg gcg ctg gtc 1680 
Gly Glu Asp Gly Glu Leu Glu Glu Thr Gly Val Pro Ala Ala Leu Val 
545 550 555 560 

act gee tgg ctt ggt cgc cac ggc att gta cct acc cgc ?cc act gac 1728 
Thr Ala Trp Leu Gly Arg His Gly He Val Pro Thr Arg Thr Thr Asp 
565 570 575 

ttc caa att atg ttc ctg ttc tct atg ggc gta acc cgt ggg aaa tgg 1776 
Phe Gin He Met Phe Leu Phe Ser Met Gly Val Thr Arg Gly Lys Trp 
580 585 590 

gga act ctg gtt aac acc ctt tgc tec ttc aaa cgc cac tat gac gec 1824 
Gly Thr Leu Val Asn Thr Leu Cys Ser Phe Lys Arg His Tyr Asp Ala 
595 600 605 

aac aca ccg ctg gcg cag gtg atg ccg gaa ctt gtt gaa caa tat cct 1872 
Asn Thr Pro Leu Ala Gin Val Met Pro Glu Leu Val Glu Gin Tyr Pro 
610 615 620 

gac act tac gcg aac atg ggg att cac gat ctg ggt gac acc atg ttt 1920 
Asp Thr Tyr Ala Asn Met Gly He His Asp Leu Gly Asp Thr Met Phe 
625 630 635 640 

gee tgg ctg aaa gaa aac aac cct ggc gca egg ttg aac gaa gee tat 1968 
Ala Trp Leu Lys Glu Asn Asn Pro Gly Ala Arg Leu Asn Glu Ala Tyr 
645 650 655 

tec ggc ctg ccg gtg gcg gaa gtc acc ccg cgt gaa gcg tac aac gcg 2016 
Ser Gly Leu Pro Val Ala Glu Val Thr Pro Arg Glu Ala Tyr Asn Ala 
660 665 670 

att gtc gac aac aat gtc gaa ctg gta tec att gaa aat ctg cca gga 2064 
He Val Asp Asn Asn Val Glu Leu Val Ser He Glu Asn Leu Pro Gly 
675 680 685 

cgc ate gcg gca aac tea gtt ate ccg tat ccg cca gga ate ccg atg 2112 
Arg He Ala Ala Asn Ser Val He Pro, Tyr Pro Pro Gly He Pro Met 
690 695 700 

ctg ctg tct ggt gaa aac ttc ggc gat aaa aac agt ccg caa gta agt 2160 
Leu Leu Ser Gly Glu Asn Phe Gly Asp Lys Asn Ser Pro Gin Val Ser 
705 710 715 720 

tat tta cgc teg ctg caa tec tgg gac cac cat ttc cct gga ttt gaa 2208 
Tyr Leu Arg Ser Leu Gin Ser Trp Asp His His Phe Pro Gly Phe Glu 
725 730 735 

cac gaa act gaa ggg act gaa att att gac ggt att tac cac gtt atg 2256 
His Glu Thr Glu Gly Thr Glu He He Asp Gly He Tyr His Val Met 
740 745 750 

tgc gtg aaa gcg taa 2271 
Cys Val Lys Ala * 
755 
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<210> 256 
<211> 762 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . . (762) 

<400> 256 

atg agg att tgc age gac caa cct tgt att gtt tta ttg act gaa aaa 4 8 

Met Arg lie Cys Ser Asp Gin Pro Cys lie Val Leu Leu Thr Glu Lys 
1 5 10 15 

gat gtc tgg ata agg gtg aat ggg aaa gaa cct att age ctt aaa get 96 
Asp Val Trp He Arg Val Asn Gly Lys Glu Pro He Ser Leu Lys Ala 
20 25 30 

aac cat atg gcg tta tta aat tgt gaa aat aat att ate gac gtc tec 144 
Asn His Met Ala Leu Leu Asn Cys Glu Asn Asn He lie Asp Val Ser 
35 40 45 

tct ctt aac aac act ttg gtt get cat att agt cac gac ate ate aaa 192 
Ser Leu Asn Asn Thr Leu Val Ala His He Ser His Asp He He Lys 
50 55 60 

gat tac etc egg ttt ctg aat aaa gat etc teg caa ata cca gta tgg 240 
Asp Tyr Leu Arg Phe Leu Asn Lys Asp Leu Ser Gin lie Pro Val Trp 
65 70 75 80 

caa cgt age get acg ccc ata etc acc ctg cca tgc ctg acg cca gac 288 
Gin Arg Ser Ala Thr Pro He' Leu Thr Leu Pro Cys Leu Thr Pro Asp 
85 90 95 

gtc ttt cgc gtt gec gcg caa cac age atg atg ccc gca gaa act gag 336 
Val Phe Arg Val Ala Ala Gin His Ser Met Met Pro Ala Glu Thr Glu 
100 105 110 

tea gaa aag gaa cga aca cgc gca tta tta ttc acg gtg eta tec cgt 384 
Ser Glu Lys Glu Arg Thr Arg Ala Leu Leu Phe Thr Val Leu Ser Arg 
115 120 125 

ttt etc gac agt aaa aaa ttc gtt tea tta atg atg tat atg tta cgt 432 
Phe Leu Asp Ser Lys Lys Phe Val Ser Leu Met Met Tyr Met Leu Arg 
130 135 140 

aat tgt gta agt gac age gtt tat caa att att gaa age gat att cac 480 
Asn Cys Val Ser Asp Ser Val Tyr Gin He He Glu Ser Asp He His 
145 150 155 160 

aaa gac tgg aat ctt agt atg gta gec agt tgt tta Lgt ctt age cca 528 
Lys Asp Trp Asn Leu Ser Met Val Ala Ser Cys Leu Cys Leu Ser Pro 
165 170 175 

agt ctg tta aag aaa aag ttg aaa age gaa aac acc agt tat age caa 576 
Ser Leu Leu Lys Lys Lys Leu Lys Ser Glu Asn Thr Ser Tyr Ser Gin 
180 185 190 

ata ate acc acc tgc cgc atg cgt tat gec gta aat gaa tta atg atg 624 
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lie lie Thr Thr Cys Arg Met Arg Tyr Ala Val Asn Glu Leu Met Met 
195 " 200 205 

gac ggt aaa aat ate tct cag gta tea cag tec tgc ggc tac aac agt 672 
Asp Gly Lys Asn He Ser Gin Val Ser Gin Ser Cys Gly Tyr Asn Ser 
210 "* 215 220 

acg teg tac ttt att tct gtc ttt aaa gac ttc tac ggt atg acg ccg 720 
Thr Ser Tyr Phe He Ser Val Phe Lys Asp Phe Tyr Gly Met Thr Pro 
225 "* 230 235 240 

ctg cat tat gtt agt cag cac aga gaa cgc act gtc gec tga 762 
Leu His Tyr Val Ser Gin His Arg Glu Arg Thr Val Ala * 
245 250 



<210> 257 
<211> 1278 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1)...(1278) 

<400> 257 

ttg ttg agg etc cct aat att tac ttt aag ggc tat att aga ata aca 48 
Met Leu Arg Leu Pro Asn He Tyr Phe Lys Gly Tyr He Arg He Thr 
15 10 15 

cag gaa aca aat atg gca aca gca tgg tat aaa caa gtt aat cca cca 96 
Gin Glu Thr Asn Met Ala Thr Ala Trp Tyr Lys Gin Val Asn Pro Pro 
20 25 30 

caa egg aaa get ctt ttt tec gca tgg ctt gga tat gta ttt gat ggc 144 
Gin Arg Lys Ala Leu Phe Ser Ala Trp Leu Gly Tyr Val Phe Asp Gly 
35 40 45 

ttt gat ttt atg atg ata ttt tac att ctt cat att ata aaa gca gat 192 
Phe Asp Phe Met Met He Phe Tyr He Leu His He He Lys Ala Asp 
50 55 60 

ctt ggc att acg gat att cag get act tta ata ggg aca gtg gec ttc 240 
Leu Gly He Thr Asp He Gin Ala Thr Leu He Gly Thr Val Ala Phe 
65 "* 70 75 80 

ata gee aga cct att gga ggt ggt ttt ttt ggt gee atg get gat aaa 288 
He Ala Arg Pro He Gly Gly Gly Phe Phe Gly Ala Met Ala Asp Lys 
85 90 95 

tat ggt cgt aag cca atg atg atg tgg gca att ttc att tac tea gtc 336 
Tyr Gly Arg Lys Pro Met Met Met Trp Ala He Phe He Tyr Ser Val 
100 105 110 



gga aca ggc ctt age ggt att get aca aac tta tat atg etc gca gtt 
Gly Thr Gly Leu Ser Gly He Ala Thr Asn Leu Tyr Met Leu Ala Val 
115 120 . 125 



384 



tgc cgt ttt att gtt ggc tta ggg atg tct ggt gaa tat gca tgt get 432 
Cys Arg Phe lie Val Gly Leu Gly Met Ser Gly Glu Tyr Ala Cys Ala 



-217- 



WO 01/34810 



PCT/USOO/30950 



130 135 140 

tea act tat gcg gta gaa agt tgg cct aaa aat ctt caa tct aaa get 480 

Ser Thr Tyr Ala Val Glu Ser Trp Pro Lys Asn Leu Gin Ser Lys Ala 

145 150 155 160 

agt get ttt ttg gta agt ggt ttt tct gtt gga aat att att gcg gca 528 

Ser Ala Phe Leu Val Ser Gly Phe Ser Val Gly Asn He He Ala Ala 

165 170 175 

caa ata ate cct cag ttt get gaa gta tat gga tgg aga aac tct ttt 57 6 

Gin He lie Pro Gin Phe Ala Glu Val Tyr Gly Trp Arg Asn Ser Phe 

180 185 190 

ttt ata ggc ctg tta cca gtt tta eta gtt ctt tgg ate aga aaa agt 624 

Phe He Gly Leu Leu Pro Val Leu Leu Val Leu Trp He Arg Lys Ser 

195 200 205 

get cca gaa agt cag gag tgg att gaa gat aaa tat aag gat aaa tea 672 

Ala Pro Glu Ser Gin Glu Trp He Glu Asp Lys Tyr Lys Asp Lys Ser 

210 215 220 

aca ttt ttg tct gtc ttc aga aaa cca cat ctt tea ate tct atg ate 720 

Thr Phe Leu Ser Val Phe Arg Lys Pro His Leu Ser He Ser Met He 

225 230 235 240 

gtt ttc etc gtc tgt ttt tgt eta ttt ggt gca aac tgg ccg ata aac 768 

Val Phe Leu Val Cys Phe Cys Leu Phe Gly Ala Asn Trp Pro He Asn 

245 250 255 

gga eta ctt cct tec tac ctg gca gat aat gga gtt aat aca gtg gtc 816 

Gly Leu Leu Pro Ser Tyr Leu Ala Asp Asn Gly Val Asn Thr Val Val 

260 265 270 

att tea act ctg atg aca ata gca ggt tta gga aca ctg aca ggt aca 864 

He Ser Thr Leu Met Thr He Ala Gly Leu Gly Thr Leu Thr Gly Thr 

275 280 285 

ata ttt ttt ggt ttt gtt ggt gat aag att ggt gta aaa aaa gee ttt 912 

He Phe Phe Gly Phe Val Gly Asp Lys He Gly Val Lys Lys Ala Phe 

290 295 300 

gta gtc ggt eta ata act tea ttt att ttc ctt tgt cct ctt ttt ttt 960 

Val Val Gly Leu He Thr Ser Phe He Phe Leu Cys Pro Leu Phe Phe 

305 310 315 320 

att tct gtg aaa aac tct tct ctt ata gga tta tgt etc ttt gga tta 1008 

He Ser Val Lys Asn Ser Ser Leu He Gly Leu Cys Leu Phe Gly Leu 

325 330 335 

atg ttt aca aat tta ggt att gca ggg ttg gtt cca aaa ttt ata tat 1056 

Met Phe Thr Asn Leu Gly He Ala Gly Leu Val Pro Lys Phe He Tyr 

340 345 350 

gat tac ttt cca aca aaa tta aga gga tta ggg ace ggt ctt att tat 1104 

Asp Tyr Phe Pro Thr Lys Leu Arg Gly Leu Gly Thr Gly Leu He Tyr 

355 360 365 

aac tta ggg gca act gga gga atg gee gca cct gta tta get aca tac 1152 

Asn Leu Gly Ala Thr Gly Gly Met Ala Ala Pro Val Leu Ala Thr Tyr 
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370 375 380 

att tea gga tat tat ggc tta ggt gtt tea tta ttc att gtt acg gtt 1200 
He Ser Gly Tyr Tyr Gly Leu Gly Val Ser Leu Phe He Val Thr Val 
385 390 395 400 

gca ttc tct gec tta tta att ttg tta gtt ggt ttt gat att cca ggt 1248 
Ala Phe Ser Ala Leu Leu lie Leu Leu Val Gly Phe Asp lie Pro Gly 
405 410 415 

aaa att tat aaa eta tec gtg get aaa tga 1278 
Lys He Tyr Lys Leu Ser Val Ala Lys * 
420 425 



<210> 258 
<211> 1134 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . . (1134) 

<400> 258 

atg att gga ggc ttt atg att aat tat ggc gtt gtt ggt gtt gga tac 48 
Met He Gly Gly Phe Met He Asn Tyr Gly Val Val Gly Val Gly Tyr 
15 10 15 

ttt ggc get gaa tta get cgt ttt atg aat atg cat gat aat gca aaa 96 
Phe Gly Ala Glu Leu Ala Arg Phe Met Asn Met His Asp Asn Ala Lys 
20 25 30 

att aca tgt gta tac gat cct gaa aat gga gaa aat att gec cgt gaa 144 
He Thr Cys Val Tyr Asp Pro Glu Asn Gly Glu Asn He Ala Arg Glu 
35 40 45 

ttg cag tgt ate aat atg tea age ttg gat get tta gtc tea agt aaa 192 
Leu Gin Cys lie Asn Met Ser Ser Leu Asp Ala Leu Val Ser Ser Lys 
50 55 60 

tta gtc gat tgc gtg ate gta gee ace cca aat tat ctg cat aaa gaa 240 
Leu Val Asp Cys Val He Val Ala Thr Pro Asn Tyr Leu His Lys Glu 
65 70 75 80 

cca gta att aaa gca gca aag aat aag aag cat gtt ttt tgt gaa aaa 288 
Pro Val He Lys Ala Ala Lys Asn Lys Lys His Val Phe Cys Glu Lys 
85 90 95 

cca att gca tta agt tat gaa gat tgt gtg gat atg gtc aaa gcg tgt 336 
Pro He Ala Leu Ser Tyr Glu Asp Cys Val Asp Met Val Lys Ala Cys 
100 105 110 

aaa gaa get ggt gtg acc ttt atg gec ggg cat att atg aat ttt ttc 384 
Lys Glu Ala Gly Val Thr Phe .Met Ala Gly His He Met Asn Phe Phe 
115 120 125 

aat ggg gtt caa tat gca egg aag tta att aaa gaa ggt gtt ate ggc 432 
Asn Gly Val Gin Tyr Ala Arg Lys Leu He Lys Glu Gly Val He Gly 
130 135 140 
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gaa ata tta tea tgt cat act aag aga aat ggc tgg gaa aac aaa caa 480 

Glu lie Leu Ser Cys His Thr Lys Arg Asn Gly Trp Glu Asn Lys Gin 

145 150 155 160 

gag aga ctt tec tgg aaa aag atg aaa gaa caa tct ggt gga cat eta 528 

Glu Arg Leu Ser Trp Lys Lys Met Lys Glu Gin Ser Gly Gly His Leu 
165 170 175 

tat cat cat ata cat gag tta gat tgt gtt cag cat tta ctt gga gaa 576 

Tyr His His lie His Glu Leu Asp Cys Val Gin His Leu Leu Gly Glu 

180 185 190 

ata cca gag acg gtt act atg att ggt gga aat ttg gec cat tct ggt 624 

He Pro Glu Thr Val Thr Met He Gly Gly Asn Leu Ala His Ser Gly 
195 200 205 

cca gga ttt ggc aat gaa gat gat atg tta ttt atg acc ttg gaa ttc 672 

Pro Gly Phe Gly Asn Glu Asp Asp Met Leu Phe Met Thr Leu Glu Phe 
210 215 220 

ccg tea gga aaa eta gca acc tta gag tgg ggg agt gca ttt aac tgg 720 

Pro Ser Gly Lys Leu Ala Thr Leu Glu Trp Gly Ser Ala Phe Asn Trp 

225 ' 230 235 240 

ccg gaa cat tat gtc ate ate aat gga act aaa ggc tct att aaa att 768 

Pro Glu His Tyr Val He He Asn Gly Thr Lys Gly Ser He Lys lie 
245 250 255 

gat atg caa gaa aca gca ggg tea ctt agg att ggc ggt cag aca aag 816 

Asp Met Gin Glu Thr Ala Gly Ser Leu Arg He Gly Gly Gin Thr Lys 

260 265 270 

cat ttt ttg gtc cat gaa aca caa gaa gaa gat gat gat cgt egg aaa 864 

His Phe Leu Val His Glu Thr Gin Glu Glu Asp Asp Asp Arg Arg Lys 
275 280 285 

ggc aat atg acc tea gaa atg gat ggc get ata gca tat ggt cat cca 912 

Gly Asn Met Thr Ser Glu Met Asp Gly Ala He Ala Tyr Gly His Pro 
290 295 300 

gga aaa aaa aca cca tta tgg ctt gec agt tta ata aga aag gag acg 960 

Gly Lys Lys Thr Pro Leu Trp Leu Ala Ser Leu He Arg Lys Glu Thr 

305 310 315 320 

tta ttc etc cat aat ate etc tgt ggt gca aaa cct gaa gaa gat tat 1008 

Leu Phe Leu His Asn He Leu Cys Gly Ala Lys Pro Glu Glu Asp Tyr 
325 330 335 

att gac ctt etc aat ggt gag gcg gec atg teg gcg att get act get 1056 

He Asp Leu Leu Asn Gly Glu Ala Ala Met Ser Ala He Ala Thr Ala 

340 345 350 

gat get gec act ctt tea aga teg cag gac agg aaa gtg aaa ate agt 1104 

Asp Ala Ala Thr Leu Ser Arg Ser Gin Asp Arg Lys Val Lys He Ser 
355 360 365 

gag ate att aaa cat aca tea gta atg taa 1134 

Glu He He Lys His Thr Ser Val Met * 
370 375 
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<210> 259 
<211> 1395 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . . (1395) 

<400> 259 

atg agt gcg ggg aaa ttg ccg gag ggg tgg gtt ate gec cca gta tct 48 
Met Ser Ala Gly Lys Leu Pro Glu Gly Trp Val lie Ala Pro Val Ser 
15 10 15 

acg gtc aca act eta ate cga gga gta acg tat aaa aaa gag cag gca 96 
Thr Val Thr Thr Leu He Arg Gly Val Thr Tyr Lys Lys Glu Gin Ala . 
20 25 30 

ata aat tat eta aaa gat gat tat ttg cct ctt ate cgt gcg aac aat 144 
He Asn Tyr Leu Lys Asp . Asp Tyr Leu Pro Leu He Arg Ala Asn Asn 
35 40 45 

att cag aat ggc aag ttt gat act acg gac ttg gtt ttt gtt cct aaa 192 
He Gin Asn Gly Lys Phe Asp Thr Thr Asp Leu Val Phe Val Pro Lys 
50 55 60 

aat ctt gtt aaa gaa agt caa aaa ata tct cct gaa gat att gtt att 240 
Asn Leu Val Lys Glu Ser Gin Lys He Ser Pro Glu Asp He Val lie 
65 70 , 75 80 

gca atg tea tea ggg age aaa tec gta gtt ggt aaa tec gca cat cag 288 
Ala Met Ser Ser Gly Ser Lys Ser Val Val Gly Lys Ser Ala His Gin 
85 90 95 

cat eta cca ttt gaa tgt agt ttc ggc gca ttt tgc ggt gta tta cgt 336 
His Leu Pro Phe Glu Cys Ser Phe Gly Ala Phe Cys Gly Val Leu Arg 
100 105 110 

cct gaa aaa ctt ata ttt tct ggt ttt att get cat ttc aca aaa tct 384 
Pro Glu Lys Leu He Phe Ser Gly Phe He Ala His Phe Thr. Lys Ser 
115 120 125 

tct ctt tat cga aac aaa att tea tea ctt tct get ggt gca aat att 4 32 

Ser Leu Tyr Arg Asn Lys He Ser Ser Leu Ser Ala Gly Ala Asn He 
130 " 135 140 

aat aat att aag ccg gca age ttt gat ttg ata aat ata cca ate cca 4 80 

Asn Asn He Lys Pro Ala Ser Phe Asp Leu He Asn He Pro He Pro 
145 150 155 160 

cca ctt gee gaa caa aaa ate ate get gaa aaa etc gat acg ctg ctg 528 
Pro Leu Ala Glu Gin Lys He He Ala Glu Lys Leu Asp Thr Leu Leu 
165 170 .175 

gcg cag gta gac age ace aaa gca cgt ttt gag caa ate cca caa ate 576 
Ala Gin Val Asp Ser Thr Lys Ala Arg Phe Glu Gin He Pro Gin He 
180 185 190 
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ctg aaa cgt ttt cgt caa gcg gta ttg ggg ggc gca gtt aat gga aaa 624 

Leu Lys Arg Phe Arg Gin Ala Val Leu Gly Gly Ala Val Asn Gly Lys 
195 200 205 

ttg aca gaa aaa tgg cgt aat ttt gag ccg caa cat tct gta ttt aag 672 

Leu Thr Glu Lys Trp Arg Asn Phe Glu Pro Gin His Ser Val Phe Lys 

210 215 220 

aag tta aat ttt gaa tct ate tta act gaa tta cgt aat ggt ctt tea 720 

Lys Leu Asn Phe Glu Ser lie Leu Thr Glu Leu Arg Asn Gly Leu Ser 

225 230 235 240 

tea aag cca aat gaa agt ggt gtt ggt cat cca ata eta cgc att agt 768 

Ser Lys Pro Asn Glu Ser Gly Val Gly His Pro lie Leu Arg lie Ser 
245 250 255 

tct gta cgt get ggc cat gta gat caa aac gat att egg ttt eta gaa 816 

Ser Val Arg Ala Gly His Val Asp Gin Asn Asp lie Arg Phe Leu Glu 
260 * 265 270 

tgt tea gaa agt gaa eta aac cgc cac aaa tta caa gat gga gat ctt 864 

Cys Ser Glu Ser Glu Leu Asn Arg His Lys Leu Gin Asp Gly Asp Leu 
275 280 285 

tta ttt act cgc tat aac gga agt tta gaa ttt gtt ggt gtt tgt ggg 912 

Leu Phe Thr Arg Tyr Asn Gly Ser Leu Glu Phe Val Gly Val Cys Gly 

290 295 300 

tta ttg aaa aaa tta caa cat caa aat ttg eta tat cct gat aaa ctt 960 

Leu Leu Lys Lys Leu Gin His Gin Asn Leu Leu Tyr Pro Asp Lys Leu 

305 ^ 310 315 320 

att cga get cga tta ace aaa gat get tta cca gaa tat ate gaa ata 1008 

lie Arg Ala Arg Leu Thr Lys Asp Ala Leu Pro Glu Tyr lie Glu lie 
325 330 335 

ttt ttt tea tec ccc tea gca cga aat gca atg atg aac tgc gtg aaa 1056 

Phe Phe Ser Ser Pro Ser Ala Arg Asn Ala Met Met Asn Cys Val Lys 
340 345 350 

aca act tct ggt caa aaa ggt att tea gga aaa gat ate aaa tec caa 1104 

Thr Thr Ser Gly Gin Lys Gly He Ser Gly Lys Asp He Lys Ser Gin 
355 " 360 365 

gtt gtt tta tta cct cca gta aaa gaa caa gee gaa ate gtt cgc cgc 1152 

Val Val Leu Leu Pro Pro Val Lys Glu Gin Ala Glu He Val Arg Arg 

370 375 380 

gtc gag caa etc ttc gec tac gec gac acc ata gaa aaa cag gtc aac 1200 

Val Glu Gin Leu Phe Ala Tyr Ala Asp Thr lie Glu Lys Gin Val Asn 

385 390 395 400 

aac gee tta gec cgc gtc aac aac ctg acg caa tec ate ctg gca aaa 1248 

Asn Ala Leu Ala Arg Val Asn Asn Leu Thr Gin Ser He Leu Ala Lys 
405 410 415 

gcg ttc cgt ggt gaa ctt acc gec cag tgg egg gec gaa aac ccg gat 1296 

Ala Phe Arg Gly Glu Leu Thr Ala Gin Trp Arg Ala Glu Asn Pro Asp 
420 425 430 
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ttg ate age gga gaa aac 
Leu lie Ser Gly Glu Asn 
435 

get gaa cgc gca get age 
Ala Glu Arg Ala Ala Ser 
450 

tga 



age gee gee gcg ttg ctg 
Ser Ala Ala Ala Leu Leu 
440 

ggg ggt aaa aaa gee tea 
Gly Gly Lys Lys Ala Ser 
455 460 



gaa aaa ate aaa 1344 

Glu Lys lie Lys 

445 

cgt aaa aaa tec 1392 
Arg Lys Lys Ser 



1395 



<210> 260 
<211> 1590 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . . (1590) 

<400> 260 

atg aac aat aac gat ctg gtc gcg aag ctg tgg aag ctg tgc gac aac 48 
Met Asn Asn Asn Asp Leu Val Ala Lys Leu Trp Lys Leu Cys Asp Asn 
15 10 15 

ctg cgc gat ggc ggc gtt tec tat caa aac tac gtc aat gaa etc gee 96 
Leu Arg Asp Gly Gly Val Ser Tyr Gin Asn Tyr Val Asn Glu Leu Ala 
20 25 30 

teg ctg ctg ttt ttg aaa atg tgt aaa gag ace ggt cag gaa gcg gaa 144 
Ser Leu Leu Phe Leu Lys Met Cys Lys Glu Thr Gly Gin Glu Ala Glu 
35 40 45 

tac ctg ccg gaa ggt tac cgc tgg gat gac ctg aaa tec cgc ate ggc 192 
Tyr Leu Pro Glu Gly Tyr Arg Trp Asp Asp Leu Lys Ser Arg lie Gly 
50 55 60 

cag gag cag ttg cag ttc tac cga aaa atg etc gtg cat tta ggc gaa 240 
Gin Glu Gin Leu Gin Phe Tyr Arg Lys Met Leu Val His Leu Gly Glu 
65 70 75 80 

gat gac aaa aag ctg gta cag gca gtt ttt cat aat gtt agt ace acc 288 
Asp Asp Lys Lys Leu Val Gin Ala Val Phe His Asn Val Ser Thr Thr 
85 90 95 

ate acc gag ccg aaa caa ata acc gca ctg gtc age aat atg gat teg 336 
lie Thr Glu Pro Lys Gin lie Thr Ala Leu Val Ser Asn Met Asp Ser 
100 105 110 

ctg gac tgg tac aac ggc gcg cac ggt aag teg cgc gat gac ttc ggc 384 
Leu Asp Trp Tyr Asn Gly Ala His Gly Lys Ser Arg Asp Asp Phe Gly 
115 120 125 

gat atg tac gaa ggg ctg ttg cag aag aac gcg aat gaa acc aag tct 432 
Asp Met Tyr Glu Gly Leu Leu Gin Lys Asn Ala Asn Glu Thr Lys Ser 
130 135 140 

ggt gca ggc cag tac ttc acc ccg cgt ccg ctg att aaa acc att att 480 



-223- 



WO 01/34810 



PCT/US00/30950 



Gly Ala Gly Gin Tyr Phe Thr Pro Arg Pro Leu lie Lys Thr lie He 
145 " 150 155 • 160 

cat ctg ctg aaa ccg cag ccg cgt gaa gtg gtg cag gac ccg gcg gca 528 
His Leu Leu Lys Pro Gin Pro Arg Glu Val Val Gin Asp Pro Ala Ala 
165 170 175 

ggt acg gcg ggc ttt ttg att gaa gcc gac cgc tat gtt aag teg caa 576 
Gly Thr Ala Gly Phe Leu He Glu Ala Asp Arg Tyr Val Lys Ser Gin 
180 185 190 

acc aat gat ctg gac gac ctt gat ggc gac acg cag gat ttc cag ate 624 
Thr Asn Asp Leu Asp Asp Leu Asp Gly Asp Thr Gin Asp Phe Gin He 
195 ** 200 205 

cac cgc gcg ttt ate ggc etc gaa ctg gtg ccc ggc acc cgt cgt ctg 672 
His Arg Ala Phe He Gly Leu Glu Leu Val Pro Gly Thr Arg Arg Leu 
210 215 220 

gca ctg atg aac tgc ctg ctg cac gaL att gaa ggc aac etc gac cac 720 
Ala Leu Met Asn Cys Leu Leu His Asp lie Glu Gly Asn Leu Asp His 
225 230 235 240 

ggc ggc gca ate cgt ctg ggc aac act ctg ggt age gac ggt gaa aac 7 68 

Gly Gly Ala He Arg Leu Gly Asn Thr Leu Gly Ser Asp Gly Glu Asn 
245 250 255 

ctg ccg aag gcg cat att gtc gcc act aac ccg ccg ttt ggc age gcc 816 
Leu Pro Lys Ala His He Val Ala Thr Asn Pro Pro Phe Gly Ser Ala 
260 265 270 

gca ggc acc aac att acc cgc acc ttt gtt cac ccg acc age aac aaa 864 
Ala Gly Thr Asn He Thr Arg Thr Phe Val His Pro Thr Ser Asn Lys 
275 280 285 

cag ttg tgc ttt atg cag cat att ate gaa acg ctg cat ccc ggc ggt 912 
Gin Leu Cys Phe Met Gin His He He Glu Thr Leu His Pro Gly Gly 
290 295 300 

cgt gcg gcg gtg gtg gtg ccg gat aac gtg ctg ttt gaa ggc ggc aaa 960 
Arg Ala Ala Val Val Val Pro Asp Asn Val Leu Phe Glu Gly Gly Lys 
305 310 315 320 

ggc acc gac att cgt cgt gac ctg atg gat aag tgt cat ctg cac acc 1008 
Gly Thr Asp He Arg Arg Asp Leu Met Asp Lys Cys His Leu His Thr 
325 330 335 

att ctg cgt ctg ccg acc ggt att ttt tac get cag ggc gtg aag acc 1056 
He Leu Arg Leu Pro Thr Gly He Phe Tyr Ala Gin Gly Val Lys Thr 
340 345 350 

aac gtg ctg ttc ttt acc aaa ggg acg gtg gcg aac ccg aat cag gat 1104 
Asn Val Leu Phe Phe Thr Lys Gly Thr Val Ala Asn Pro Asn Gin Asp 
355 360 365 

aag aac tgt acc gat gat gtg tgg gtg tat gac ctg cgt acc aat atg 1152 
Lys Asn Cys Thr Asp Asp Val Trp Val Tyr Asp Leu Arg Thr Asn Met 
370 375 380 

ccg agt ttc ggc aag cgc aca ccg ttt acc gac gag cat ttg cag ccg 1200 
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Pro Ser Phe Gly Lys Arg Thr Pro Phe Thr Asp Glu His Leu Gin Pro 
385 390 395 400 

ttt gag cgc gtg tat ggc gaa gac ccg cac ggt tta age ccg cgc act 1248 
Phe Glu Arg Val Tyr Gly Glu Asp Pro His Gly Leu Ser Pro Arg Thr 
405 410 415 

gaa ggt gaa tgg agt ttt aac gec gaa gag acg gaa gtt gec gac age 1296 
Glu Gly Glu Trp Ser Phe Asn Ala Glu Glu Thr Glu Val Ala Asp Ser 
420 425 430 

gaa gag aac aaa aac acc gac cag cat ctt get ace age cgc tgg cgc 1344 
Glu Glu Asn Lys Asn Thr Asp Gin His Leu Ala Thr Ser Arg Trp Arg 
435 440 445 

aag ttc age cgt gag tgg ate cgc acc gca aaa tec gat teg ctg gat 1392 
Lys Phe Ser Arg Glu Trp lie Arg Thr Ala Lys Ser Asp Ser Leu Asp 
450 455 460 

ate tec tgg ctg aaa gat aaa gac agt att gat gee gac age ctg ccg 1440 
He Ser Trp Leu Lys Asp Lys Asp Ser He Asp Ala Asp Ser Leu Pro 
465 470 475 480 

gag ccg gat gta tta gcg gca gaa gcg atg ggc gaa ctg gta cag gcg 1488 
Glu Pro Asp Val Leu Ala Ala Glu Ala Met Gly Glu Leu Val Gin Ala 
485 490 495 

ctg tct gaa ctg gat gcg ctg atg cgt gaa ctg ggg gcg age gat gag 1536 
Leu Ser Glu Leu Asp Ala Leu Met Arg Glu Leu Gly Ala Ser Asp Glu 
500 505 510 

gee gat ttg cag cgt cag ttg ctg gaa gaa gcg ttt ggt ggg gtg aag 1584 
Ala Asp Leu Gin Arg Gin Leu Leu Glu Glu Ala Phe Gly Gly Val Lys 
515 520 525 

gaa tga 1590 
Glu * 



<210> 261 
<211> 297 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . . (297) 

<400> 261 

atg aaa aaa gag aac tat tea ttc aag caa get tgt get gtt gtc ggt 48 

Met Lys Lys Glu Asn Tyr Ser Phe Lys Gin Ala Cys Ala Val Val Gly 
1 5 10 15 

ggg caa tea gca atg get agg ctt tta ggt gta tea cct cca age gta 96 
Gly Gin Ser Ala Met Ala Arg Leu Leu Gly Val Ser Pro Pro Ser Val 
20 25 30 

aat caa tgg ate aaa ggg gta cgt caa ttg cct gee gag aga tgt cca 144 
Asn Gin Trp He Lys Gly Val Arg Gin Leu Pro Ala Glu Arg Cys Pro 
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35 40 45 

gca att gaa cgt gca aca aga ggt gag gtt ctg tgc gaa gaa ctt cgt 192 
Ala lie Glu Arg Ala Thr Arg Gly Glu Val Leu Cys Glu Glu Leu Arg 
50 55 60 

cct gat att gac tgg tea tat tta cga cgt teg gca tgt tgt teg cag 240 
Pro Asp lie Asp Trp Ser Tyr Leu Arg Arg Ser Ala Cys Cys Ser Gin 
65 70 75 80 

aat atg tea gtg aag caa eta aat gac agt aac aaa tec tea ttt gat 288 
Asn Met Ser Val Lys Gin Leu Asn Asp Ser Asn Lys Ser Ser Phe Asp 
85 90 95 

cat ace tga 297 
His Thr * 



<210> 262 
<211> 423 
<212> DNA 

<213> Escherichia coli 






















<220> 
<221> CDS 
<222> (1) . . . 


(423) 
























<400> 262 
atg aaa ate 
Met Lys He 
1 


aag 
Lys 


cat 
His 
5 


gag 
Glu 


cac 
His 


ate 
He 


gaa 
Glu 


tea 
Ser 
10 


gtg ttg 
Val Leu 


ttt 
Phe 


gee 
Ala 


eta 
Leu 
15 


gca 
Ala 


gec gaa 
Ala Glu 


aaa 
Lys 


ggg 

Gly 
20 


cag 
Gin 


gca 
Ala 


tgg 
Trp 


gta 
Val 


gee 
Ala 
25 


aat 
Asn 


gca 
Ala 


att 
He 


act 
Thr 


gaa 
Glu 
30 


gaa 
Glu 


tat 
1'yr 


ctg cgc 
Leu Arg 


cag 
Gin 
35 


ggg 

Gly 


ggc 

Gly 


ggc 
Gly 


gaa 
Glu 


ttg 
Leu 
40 


ccc 
Pro 


ctg gtt 
Leu Val 


cca ggc aag 
Pro Gly Lys 
45 


gac tgg 
Asp Trp 


aac aat 
Asn Asn 
50 


cag 
Gin 


cag 
Gin 


aat 
Asn 


ate 
He 


tat 
Tyr 
55 


cac 
His 


cgt 
Arg 


tgg 
Trp 


ttg 
Leu 


aaa 
Lys 
60 


ggt gaa 
Gly Glu 


acg 
Thr 


aaa 
Lys 


acg caa 
Thr Gin 
65 


aga 
Arg 


gaa 
Glu 


aaa 
Lys 


att 
He 
70 


cag 
Gin 


aag 
Lys 


ctg 
Leu 


ate 
He 


cca 
Pro 
75 


gca 
Ala 


att 
He 


ctg 
Leu 


gca 
Ala 


ate 
He 
80 


ctt ccg 
Leu Pro 


cgc 
Arg 


gag 
Glu 


ctg 
Leu 
85 


cgt 
Arg 


cac cga etc tgc 
His Arg Leu Cys 
90 


ate 
He 


ttc gat ace 
Phe Asp Thr 


ctg 
Leu 
95 


gaa 
Glu 


cgc cgt 
Arg Arg 


gca 
Ala 


tta 
Leu 
100 


ctg 
Leu 


gcg 
Ala 


gcg 
Ala 


cag 
Gin 


gaa 
Glu 
105 


gcg 

Ala 


tta 
Leu 


agt 
Ser 


acg 
Thr 


gca 
Ala 
110 


att gat 
He Asp 


gcg cat 
Ala His 


gat 
Asp 
115 


gat 
Asp 


gca 
Ala 


gtc 
Val 


caa 
Gin 


gee 
Ala 
120 


gtt 
Val 


tac egg 
Tyr Arg 


aaa 
Lys 


gcg 
Ala 
125 


cat 
His 


ttc 
Phe 


age 
Ser 



48 



96 



144 



192 



240 



288 



336 



384 
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ggc ggc ggt tct tec gac gat tct gtc att gtt cat taa 
Gly Gly Gly Ser Ser Asp Asp Ser Val He Val His * 
130 135 140 



423 



<210> 263 
<211> 858 
<212> DNA 

<213> Escherichia coli* 

<220> 

<221> CDS 

<222> (1) • . . (858) 

<400> 263 

atg ctg ttt gtg ctt att eta age cac egg gca gca tea tac ggg gca 48 
Met Leu Phe Val Leu He Leu Ser His Arg Ala Ala Ser Tyr Gly Ala 
15 10 15 

att atg gee gca tta cca tac atg caa ctg tac ata get gat tac ctg 96 
He Met Ala Ala Leu Pro Tyr Met Gin Leu Tyr He Ala Asp Tyr Leu 
20 " 25 30 

get gac acc atg cat ttg tea gca gag gag cat ggt gcg tat ttg ttg 144 
Ala Asp Thr Met His Leu Ser Ala Glu Glu His Gly Ala Tyr Leu Leu 
35 40 45 

ctg atg ttc aat tac tgg caa aca gga aag cca ata cct aaa aac agg 192 
Leu Met Phe Asn Tyr Trp Gin Thr Gly Lys Pro He Pro Lys Asn Arg 
50 " 55 60 

ctg gca aaa att gee cgt ctg act aac gag cga tgg get gat gtt gaa 240 
Leu Ala Lys He Ala Arg Leu Thr Asn Glu Arg Trp Ala Asp Val Glu 
65 70 75 80 

cca tec ttg cag gag ttt ttt tgc gat aac ggc gag gaa tgg gtg cat 288 
Pro Ser Leu Gin Glu Phe Phe Cys Asp Asn Gly Glu Glu Trp Val His 
85 90 95 

ctt egg att gag gaa gat ctg gca tea gtc agg gaa aaa tta acc aaa 336 
Leu Arg He Glu Glu Asp Leu Ala Ser Val Arg Glu Lys Leu Thr Lys 
100 105 110 

aaa tea gee gca gga aaa gca tct gtt cag gee aga aga age aga aag 384 
Lys Ser Ala Ala Gly Lys Ala Ser Val Gin Ala Arg Arg Ser Arg Lys 
115 120 125 

gaa gca gat gtt caa aca aaa caa gag aga aat tta aca ggt gtt caa 432 
Glu Ala Asp Val Gin Thr Lys Gin Glu Arg Asn Leu Thr Gly Val Gin 
130 135 140 

aca gat gtt gaa gtg gtg ttt gaa cat gat gtc aac aca aag gca act 480 
Thr Asp Val Glu Val Val Phe Glu His Asp Val Asn Thr Lys Ala Thr 
145 150 155 160 

aat aaa gat aca gat aaa gat eta aaa aca gat ccc ccc eta aat ccc 528 
Asn Lys Asp Thr Asp Lys Asp Leu Lys Thr Asp Pro Pro Leu Asn Pro 
165 170 175 
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ccc egg ggg aat cga ggt gtc aaa aag ttt gac cct ctg gat att act 576 

Pro Arg Gly Asn Arg Gly Val Lys Lys Phe Asp Pro Leu Asp He Thr 
180 185 * 190 

ttg ccg aac tgg att tct gtc teg ctt tgg cgt gag tgg gtt gaa ttt 624 

Leu Pro Asn Trp He Ser Val Ser Leu Trp Arg Glu Trp Val Glu Phe 

195 200 205 

cgc cag gca ttg cga aaa ccg att cga acg gag cag ggc get aac ggg 672 

Arg Gin Ala Leu Arg Lys Pro He Arg Thr Glu Gin Gly Ala Asn Gly 

210 215 220 

gcg ata egg gag ctg gaa aaa ttc cgc cag cag ggt ttt tea cct gag 720 

Ala lie Arg Glu Leu Glu Lys Phe Arg Gin Gin Gly Phe Ser Pro Glu 

225 230 235 240 

cag gtg att cga cac age ate gec aat gaa tac cag ggc ttg ttc gcg 768 

Gin Val He Arg His Ser He Ala Asn Glu Tyr Gin Gly Leu Phe Ala 
245 250 255 

ccg aaa ggt gtt cga cct gag acg tta etc cga cag gtt aac acc gtc 816 

Pro Lys Gly Val Arg Pro Glu Thr Leu Leu Arg Gin Val Asn Thr Val 
260 265 270 

teg tta ccg gat agt gcg ate ccg cca ggc ttc agg ggg taa 858 

Ser Leu Pro Asp Ser Ala He Pro Pro Gly Phe Arg Gly * 

275 280 285 



<210> 264 
<211> 747 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . . (747) 

<400> 264 

atg aaa aat att gcg aca ggc gat gtt ctt gaa cgt ate cgc aga ctg 48 

Met Lys Asn He Ala Thr Gly Asp Val Leu Glu Arg He Arg Arg Leu 
15 10 15 

gec ccg tea cat gta acc gcg cca ttc aag acg gta gcg gag tgg cgc 96 
Ala Pro Ser His Val Thr Ala Pro Phe Lys Thr Val Ala Glu Trp Arg 
20 25 30 

gag tgg caa ctt tec gaa ggc cag aaa cgt tgt gag gag ate aac cgt 144 
Glu Trp Gin Leu Ser Glu Gly Gin Lys Arg Cys Glu Glu He Asn Arg 
35 40 45 

cag aat cgt cag ttg egg gtg gaa aaa att ctg aat cgc tct ggc ate 192 
Gin Asn Arg Gin Leu Arg Val Glu Lys He Leu Asn Arg Ser Gly He 
50 55 60 

cag cca ttg cac cgc aaa tgc teg ttt teg aat tac cag gtg cag aac 240 
Gin Pro Leu His Arg Lys Cys Ser Phe Ser Asn Tyr Gin Val Gin Asn 
65 70 75 80 

gaa ggg cag cga tac gcg ttg agt cag gcg aaa tec ate get gat gaa 288 
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Glu Gly Gin Arg Tyr Ala Leu Ser Gin Ala Lys Ser lie Ala Asp Glu 
85 90 95 

ctg atg acc ggg tgt aca aat ttt gcg ttc age gga aaa cct ggt acc 336 
Leu Met Thr Gly Cys Thr Asn Phe Ala Phe Ser Gly Lys Pro Gly Thr 
100 105 110 

ggg aag aac cac tta gcg gca get ate ggg aat cgc ctg ctg aaa gac 384 
Gly Lys Asn His Leu Ala Ala Ala lie Gly Asn Arg Leu Leu Lys Asp 
115 120 125 

ggt cag aca gtg att gtg gtt acc gtg get gat gtt atg agt gee ctg 432 
Gly Gin Thr Val lie Val Val Thr Val Ala Asp Val Met Ser Ala Leu 
130 135 140 

cac gec age tat gac gat ggg cag tea ggc gaa aaa ttt ttg egg gaa 480 
His Ala Ser Tyr Asp Asp Gly Gin Ser Gly Glu Lys Phe Leu Arg Glu 
145 " 150 155 160 

ctg tgc gaa gtg gat ctg ctg gtt ctt gat gaa att ggc att cag cgc 528 
Leu Cys Glu Val Asp Leu Leu Val Leu Asp Glu lie Gly lie Gin Arg 
165 170 175 . 

gag acg aaa aac gag cag gtg gta ctg cac cag att gtt gat cgc egg 576 
Glu Thr Lys Asn Glu Gin Val Val Leu His Gin He Val Asp Arg Arg 
180 185 190 

aca gcg teg atg cgc age gtg ggg atg ctg aca aac ctg aac tat gag 624 
Thr Ala Ser Met Arg Ser Val Gly Met Leu Thr Asn Leu Asn Tyr Glu 
195 200 205 

gec atg aaa aca ttg etc ggc gag egg att atg gat cgc atg acc atg 672 
Ala Met Lys Thr Leu Leu Gly Glu Arg He Met Asp Arg Met Thr Met 
210 215 220 

aac ggc ggg cga tgg gtg aat ttt aac tgg gag age tgg cgt ccg aat 720 
Asn Gly Gly Arg Trp Val Asn Phe Asn Trp Glu Ser Trp Arg Pro Asn 
225 230 235 240 

gtc gtc cag cca gga att gcg aag taa .747 
Val Val Gin Pro Gly He Ala Lys * 
245 



<210> 265 
<211> 612 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . (612) 

<400> 265 

atg teg tec age cag gaa ttg cga agt aat ttt tac egg gag aaa aat 48 
Met Ser Ser Ser Gin Glu Leu Arg Ser Asn Phe Tyr Arg Glu Lys Asn 
1 5 10 15 

tta atg gag act gtt ttt gac gca ctg aaa gca atg gga aaa gee aca 96 
Leu Met Glu Thr Val Phe Asp Ala Leu Lys Ala Met Gly Lys Ala Thr 
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20 



25 



30 



tec ata gaa ctt get gcg cga ctt gat ate agt cgt gaa gaa gtg ctg 
Ser He Glu Leu Ala Ala Arg Leu Asp He Ser Arg Glu Glu Val Leu 
35 40 45 



144 



aac gaa eta tgg gaa ctg aaa aag get ggt ttt gtt gat aaa age gcg 
Asn Glu Leu Trp Glu Leu Lys Lys Ala Gly Phe Val Asp Lys Ser Ala 
50 55 60 



192 



tac ace tgg cgt gtg get gat aac aat gtt cag cag gaa cag cca gcg 
Tyr Thr Trp Arg Val Ala Asp Asn Asn Val Gin Gin Glu Gin Pro Ala 
65 70 75 80 



240 



cag gca gaa ctg ccg gaa gaa ate ace aca gca aca gta gcg aaa ate 
Gin Ala Glu Leu Pro Glu Glu He Thr Thr Ala Thr Val Ala Lys He 
85 90 95 



288 



tea gag tgc gat tta acc gcg acg att gaa caa cga gga cca caa acg 
Ser Glu Cys Asp Leu Thr Ala Thr He Glu Gin Arg Gly Pro Gin Thr 
100 105 110 



336 



get gat gag ctg get aca ttg ttt ggt acc aca tea cgc aaa gtg get 
Ala Asp Glu Leu Ala Thr Leu Phe Gly Thr Thr Ser Arg Lys Val Ala 
115 120 125 



384 



tea acg ctg gca atg gca ate age aaa ggt cgt ctg att cgc gta aat 
Ser Thr Leu Ala Met Ala He Ser Lys Gly Arg Leu He Arg Val Asn 
130 135 * 140 



432 



cag ggc ggt aaa ttt cgt tac tgc ata ccg ggc gat aat tta cca gca 
Gin Gly Gly Lys Phe Arg Tyr Cys He Pro Gly Asp Asn Leu Pro Ala 
145 150 155 160 



480 



gag ccg aaa gca gca teg gta tct ccg etc tgg tta tct gca teg teg 
Glu Pro Lys Ala Ala Ser Val Ser Pro Leu Trp Leu Ser Ala Ser Ser 
165 170 175 



528 



tct gee tgt cat ggg gtg tta ate att acc gtg ata acg cca teg cct 
Ser Ala Cys His Gly Val Leu He He Thr Val He Thr Pro Ser Pro 
180 * 185 190 



576 



aca aag aac age gcg aca aaa atg cca gag aac tga 
Thr Lys Asn Ser Ala Thr Lys Met Pro Glu Asn * 
195 200 



612 



<210> 266 
<211> 306 
<212> DNA 

<213> Escherichia coli 



<220> 
<221> CDS 
<222> (1). 



(306) 



<400> 266 

atg cag atg cgt cag cgt gat gtt get gcg etc gat gca aaa tac acg 

Met Gin Met Arg Gin Arg Asp Val Ala Ala Leu Asp Ala Lys Tyr Thr 
1 5 10 15 



48 
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aag gag tta get gat gcg aaa get gaa aat gat get ctg cgt gat gat 96 
Lys Glu Leu Ala Asp Ala Lys Ala Glu Asn Asp Ala Leu Arg Asp Asp 
20 25 30 

gtt gee get ggt cgt cgt egg ttg cac ate aaa gca gtc tgt cag tea 144 
Val Ala Ala Gly Arg Arg Arg Leu His lie Lys Ala Val Cys Gin Ser 
35 "* 40 45 

gtg cgt gaa gee ace ace gee tec ggc gtg gat aat gca gee tec ccc 192 
Val Arg Glu Ala Thr Thr Ala Ser Gly Val Asp Asn Ala Ala Ser Pro 
50 55 60 

cga ctg gca gac acc get gaa egg gat tat ttc ace etc egg gaa cga 240 
Arg Leu Ala Asp Thr Ala Glu Arg Asp Tyr Phe Thr Leu Arg Glu Arg 
65 70 75 80 

ctg gta atg atg cag gee caa ctt gaa ggt get cag caa tac ata acc 288 
Leu Val Met Met Gin Ala Gin Leu Glu Gly Ala Gin Gin Tyr He Thr 
85 90 95 

gag cag tgt tta aag taa 306 
Glu Gin Cys Leu Lys * 
100 



<210> 267 
<211> 879 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . . (879) 

<400> 267 

atg aaa ctg gga ttt att ggc tta ggc att atg ggt aca ccg atg gee 48 
Met Lys Leu Gly Phe He Gly Leu Gly He Met Gly Thr Pro Met Ala 
15 10 15 

att aat ctg gcg cgt gee ggt cat caa tta cat gtc acg acc att gga 96 
He Asn Leu Ala Arg Ala Gly His Gin Leu His Val Thr Thr lie Gly 
20 25 30 

ccg gtt get gat gaa tta ctg tea ctg ggt gee gtc agt gtt gaa act 144 
Pro Val Ala Asp Glu Leu Leu Ser Leu Gly Ala Val Ser Val Glu Thr 
35 40 45 

get cgc cag gta acg gaa gca teg gac ate att ttt att atg gtg ccg 192 
Ala Arg Gin Val Thr Glu Ala Ser Asp He He Phe He Met Val Pro 
50 55 60 

gac aca cct cag gtt gaa gaa gtt ctg ttc ggt gaa aat ggt tgt acc 240 
Asp Thr Pro Gin Val Glu Glu Val Leu Phe Gly Glu Asn Gly Cys Thr 
65 70 75 80 

aaa gee teg ctg aag ggc aaa acc att gtt gat atg age tec att tec 288 
Lys Ala Ser Leu Lys Gly Lys Thr He Val Asp Met Ser Ser He Ser 
85 90 95 
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ccg att gaa act aag cgt ttc get cgt cag gtg aat gaa ctg ggc ggc 336 
Pro lie Glu Thr Lys Arg Phe Ala Arg Gin Val Asn Glu Leu Gly Gly 
100 105 110 

gat tat etc gat gcg cca gtc tec ggc ggt gaa ate ggt gcg cgt gaa 384 
Asp Tyr Leu Asp Ala Pro Val Ser Gly Gly Glu He Gly Ala Arg Glu 
115 120 125 

ggg acg ttg teg att atg gtt ggc ggt gat gaa gcg gta ttt gaa cgt 432 
Gly Thr Leu Ser He Met Val Gly Gly Asp Glu Ala Val Phe Glu Arg 
130 135 140 

gtt aaa ccg ctg ttt gaa ctg etc ggt aaa aat ate acc etc gtg ggc 4 80 

Val Lys Pro Leu Phe Glu Leu Leu Gly Lys Asn He Thr Leu Val Gly 
145 150 155 160 

ggt aac ggc gat ggt caa acc tgc aaa gtg gca aat cag att ate gtg 528 
Gly Asn Gly Asp Gly Gin Thr Cys Lys Val Ala Asn Gin lie He Val 
165 170 175 

gcg etc aat att gaa gcg gtt tct gaa gee ctg eta ttt get tea aaa 576 
Ala Leu Asn He Glu Ala Val Ser Glu Ala Leu Leu Phe Ala Ser Lys 
180 185 190 

gee ggt gcg gac ccg gta cgt gtg cgc cag gcg ctg atg ggc ggc ttt 624 
Ala Gly Ala Asp Pro Val Arg Val Arg Gin Ala Leu Met Gly Gly Phe 
195 200 205 

get tec tea cgt att ctg gaa gtt cat ggc gag cgt atg att aaa cgc 672 
Ala Ser Ser Arg He Leu Glu Val His Gly Glu Arg Met He Lys Arg 
210 215 220 

acc ttt aat ccg ggc ttc aaa ate get ctg cac cag aaa gat etc aac 720 
Thr Phe Asn Pro Gly Phe Lys He Ala Leu His Gin Lys Asp Leu Asn 
225 230 235 240 

ctg gca ctg caa agt gcg aaa gca ctt gcg ctg aac ctg cca aac act 768 
Leu Ala Leu Gin Ser Ala Lys Ala Leu Ala Leu Asn Leu Pro Asn Thr 
245 250 255 

gcg acc tgc cag gag tta ttt aat acc tgt gcg gca aac ggt ggc age 816 
Ala Thr Cys Gin Glu Leu Phe Asn Thr Cys Ala Ala Asn Gly Gly Ser 
260 265 270 

cag ttg gat cac tct gcg tta gtg cag gcg ctg gaa tta atg get aac 8 64 

Gin Leu Asp His Ser Ala Leu Val Gin Ala Leu Glu Leu Met Ala Asn 
275 280 285 

cat aaa ctg gee tga 879 
His Lys Leu Ala * 
290 



<210> 268 
<211> 279 
<212> DNA 

<213> Escherichia coli 

<220> 
<221> CDS 
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<222> (1) . . . (279) 
<400> 268 

atg aat egg cca gca ata tta aaa aag aaa gca gec aaa gat gtt get 48 
Met Asn Arg Pro Ala lie Leu Lys Lys Lys Ala Ala Lys Asp Val Ala 
15 10 15 



tea gta tta aaa ata ata ttt tta ttt tat ttg ttc etc ata get aga 
Ser Val Leu Lys He lie Phe Leu Phe Tyr Leu Phe Leu He Ala Arg 
20 25 30 

tta aaa caa cgt tat teg ata cgt gaa att aag agg gat tta tgg aac 
Leu Lys Gin Arg Tyr Ser He Arg Glu He Lys Arg Asp Leu Trp Asn 
35 40 45 

ate aga gaa aac tat tec age aac gcg get ata gcg aag ate tat tgc 
He Arg Glu Asn Tyr Ser Ser Asn Ala Ala He Ala Lys He Tyr Cys 
50 55 60 

cga aaa cgc aaa gee age gga cct gga aaa cat tta act att tta cct 
Arg Lys Arg Lys Ala Ser Gly Pro Gly Lys His Leu Thr He Leu Pro 
65 " 70 75 80 

tat gga tgg gtt egg ttc ata acg ttc cca att atg tga 
Tyr Gly Trp Val Arg Phe He Thr Phe Pro He Met * 
85 90 



96 



144 



192 



240 



279 



<210> 269 
<211> 1314 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . . (1314) 

<400> 269 

atg gtc ggc ggc ttt ttt att etc ggc ttg tct acc ttt agt att atg 

Met Val Gly Gly Phe Phe He Leu Gly Leu Ser Thr Phe Ser He Met 

1 5 . 10 15 

ctg gca att ate etc age gee ttt ttc att gee gcg gta atg gta tta 
Leu Ala He He Leu Ser Ala Phe Phe He Ala Ala Val Met Val Leu 
20 25 30 



ggc gga att gee gec ate atg tgg ttt ggt ttg caa tgt tac gcg ggg 
Gly Gly He Ala Ala He Met Trp Phe Gly Leu Gin Cys Tyr Ala Gly 
65 70 75 80 

tea ctg gee tgc ttg att ctg att ggc aaa ate tgg ccg gga ttt tta 
Ser Leu Ala Cys Leu He Leu He Gly Lys He Trp Pro Gly Phe Leu 



48 



96 



aac ggt get gcg ggc agt aaa tac ggt gtg cct ttt gee atg ate ctg 144 

Asn Gly Ala Ala Gly Ser Lys Tyr Gly Val Pro Phe Ala Met He Leu 
35 40 45 

cgt get tct tac ggt gta cgt ggt gca ctg ttt ccc gga tta tta agg 192 

Arg Ala Ser Tyr Gly Val Arg Gly Ala Leu Phe Pro Gly Leu Leu Arg 
50 55 60 



240 



288 
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85 90 95 

act etc ggt ggt gat ttc act ctg tta ggc ctt tct eta ccg ggc tta 336 

Thr Leu Gly Gly Asp Phe Thr Leu Leu Gly Leu Ser Leu Pro Gly Leu 
100 105 110 

att act ttc tta ate ttc tgg ctg gtc aac gtt ggt ata ggt ttt ggc 384 

He Thr Phe Leu He Phe Trp Leu Val Asn Val Gly He Gly Phe Gly 

115 120 125 

ggt ggc aaa gtt tta aat aaa ttc act gec att ctt aac ccg tgc ate 4 32 

Gly Gly Lys Val Leu Asn Lys Phe Thr Ala He Leu Asn Pro Cys He 

130 135 140 

tat ate gtt ttc ggc ggt atg gcg att tgg gcg att tea ctg gtc ggg 480 

Tyr He Val Phe Gly Gly Met Ala He Trp Ala He Ser Leu Val Gly 

145 150 155 160 

ate ggt cca ate ttt gac tac att ccg age ggt att cag aaa gca gaa 528 

He Gly Pro He Phe Asp Tyr He Pro Ser Gly He Gin Lys Ala Glu 
165 170 175 

aac ggt ggc ttc ctg ttc ctg gtg gtg att aac gcg gta gtt gcg gtc 576 

Asn Gly Gly Phe Leu Phe Leu Val Val He Asn Ala Val Val Ala Val 
180 185 190 

tgg gcg gca ccg gcg gtg age gca tec gac ttt acg caa aac gee cac 624 

Trp Ala Ala Pro Ala Val Ser Ala Ser Asp Phe Thr Gin Asn Ala His 

195 200 205 

teg ttt cgt gag cag gcg ctg ggg caa acg ctg ggt tta gtt gtg gec 672 

Ser Phe Arg Glu Gin Ala Leu Gly Gin Thr Leu Gly Leu Val Val Ala 

210 215 220 

tat att ctg ttt gcg gtc gec ggg gta tgt att att gee gga gec agt 720 

Tyr He Leu Phe Ala Val Ala Gly Val Cys He He Ala Gly Ala Ser 

225 230 235 240 

att cac tac ggc get gat ace tgg aac gtg ctg gat att gtt cag cgt 7 68 

He His Tyr Gly Ala Asp Thr Trp Asn Val Leu Asp He Val Gin Arg 
245 250 255 

tgg gac age ctg ttc gee teg ttc ttt gcg gta ctg gtt att ctg atg 816 

Trp Asp Ser Leu Phe Ala Ser Phe Phe Ala Val Leu Val He Leu Met 
260 265 270 

aca act ate tec act aac gcg ace ggt aat att att cca gee ggt tat 8 64 

Thr Thr He Ser Thr Asn Ala Thr Gly Asn He He Pro Ala Gly Tyr 

275 280 285 

cag att gec gec att gca ccg aca aaa ctg ace tat aaa aac ggc gta 912 

Gin He Ala Ala He Ala Pro Thr Lys Leu Thr Tyr Lys Asn Gly Val 

290 295 300 

ctg att gee agt att ate age ttg ctg ate tgc ccg tgg aaa tta atg 960 

Leu He Ala Ser He He Ser Leu Leu lie Cys Pro Trp Lys Leu Met 

305 310 315 320 

gaa aat cag gac age att tat ctt ttc etc gat att ate ggc gga atg 1008 

Glu Asn Gin Asp Ser He Tyr Leu Phe Leu Asp He He Gly Gly Met 
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325 330 335 

ctt ggt ccg gta att ggt gtc atg atg gcg cat tat ttt gtg gtg atg 1056 
Leu Gly Pro Val lie Gly Val Met Met Ala His Tyr Phe Val Val Met 
340 345 350 

cgc gga caa att aat ctt gat gaa ctg tat acc gca cct ggc gat tat 1104 
Arg Gly Gin lie Asn Leu Asp Glu Leu Tyr Thr Ala Pro Gly Asp Tyr 
355 360 365 

aaa tat tac gat aac ggt ttt aac etc act gcg ttt tea gta act ctg 1152 
Lys Tyr Tyr Asp Asn Gly Phe Asn Leu Thr Ala Phe Ser Val Thr Leu 
370 375 380 

gtg gec gtt att tta tct ctt ggc ggt aag ttt att cac ttt atg gaa 1200 
Val Ala Val lie Leu Ser Leu Gly Gly Lys Phe He His Phe Met Glu 
385 390 395 400 

ccg tta teg cgt gtt tea tgg ttt gtc ggc gtc ate gtc gee ttt gcg 1248 
Pro Leu Ser Arg Val Ser Trp Phe Val Gly Val He Val Ala Phe Ala 
405 410 415 

gec tac gee tta tta aag aaa cgt aca aca gca gaa aaa aca gga gag 1296 
Ala Tyr Ala Leu Leu Lys Lys Arg Thr Thr Ala Glu Lys Thr Gly Glu 
420 425 430 

caa aaa acc ata ggt taa 1314 
Gin Lys Thr He Gly * 
435 



<210> 270 
<211> 1395 
<212> DNA 

<213> Escherichia coli 

<220> 
<221> CDS 

<222> (1) . . . (1395) 
<400> 270 

gtg tta tta ctg gat gcg tgc teg caa atg tgc ccg tea ttc aga cga 48 
Met Leu Leu Leu Asp Ala Cys Ser Gin Met Cys Pro Ser Phe Arg Arg 
15 10 15 

ttc cag aca gtg ttt cat aat tec tec att ttt etc cct tat tgg ctg 96 
Phe Gin Thr Val Phe His Asn Ser Ser He Phe Leu Pro Tyr Trp Leu 
20 25 . 30 

get aca eta gta tea ttc cgc gaa acg ttt cag gaa gag aaa etc tta 144 
Ala Thr Leu Val Ser Phe Arg Glu Thr Phe Gin Glu Glu Lys Leu Leu 
35 40 45 

acg atg aaa ggt agt tat aaa tec cgt tgg gta ate gta ate gtg gtg 192 
Thr Met Lys Gly Ser Tyr Lys Ser Arg Trp Val He Val He Val Val 
50 ^ 55 60 

gtt ate gec gec ate gee gca ttc tgg ttc tgg caa ggc cgc aat gac 240 
Val He Ala Ala He Ala Ala Phe Trp Phe Trp Gin Gly Arg Asn Asp 
65 70 75 80 
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tec egg agt gca gee cca ggg gcg acg aaa caa gcg cag caa teg cca 288 

Ser Arg Ser Ala Ala Pro Gly Ala Thr Lys Gin Ala Gin Gin Ser Pro 
85 90 95 

gcg ggt ggt cga cgt ggt atg cgt tec ggc cca tta gee ccg gtt cag 336 

Ala Gly Gly Arg Arg Gly Met Arg Ser Gly Pro Leu Ala Pro Val Gin 

100 105 110 

gcg gcg ace gec gta gaa cag gca gtt ccg cgt tac etc acc ggg ctt 384 

Ala Ala Thr Ala Val Glu Gin Ala Val Pro Arg Tyr Leu Thr Gly Leu 
115 120 125 

ggc acc att acc gee get aat acc gtt acg gtg cgc age cgc gtg gac 432 

Gly Thr lie Thr Ala Ala Asn Thr Val Thr Val Arg Ser Arg Val Asp 
130 135 140 

ggc caa ctg ata gcg tta cat ttc cag gaa ggc cag cag gtc aaa gca 480 

Gly Gin Leu He Ala Leu His Phe Gin Glu Gly Gin Gin Val Lys Ala 
145 150 155 160 

ggc gat tta ctg gca gaa att gac ccc age cag ttc aaa gtt gca tta 528 

Gly Asp Leu Leu Ala Glu He Asp Pro Ser Gin Phe Lys Val Ala Leu 
165 170 175 

gca caa gee cag ggc caa ctg gca aaa gat aaa gee acg ctt gee aac 576 

Ala Gin Ala Gin Gly Gin Leu Ala Lys Asp Lys Ala Thr Leu Ala Asn 

180 185 190 

gee cgc cgt gac ctg gcg cgt tat caa caa ctg gca aaa acc aat etc 624 

Ala Arg Arg Asp Leu Ala Arg Tyr Gin Gin Leu Ala Lys Thr Asn Leu 
195 200 205 

gtt tec cgc cag gag ctg gat gee caa cag gcg ctg gtc agt gaa acc 672 

Val Ser Arg Gin Glu Leu Asp Ala Gin Gin Ala Leu Val Ser Glu Thr 
210 ~ 215 220 

gaa ggc acc att aag get gat gaa gca age gtt gee age gcg cag ctg 720 

Glu Gly Thr He Lys Ala Asp Glu Ala Ser Val Ala Ser Ala Gin Leu 
225 ~ 230 235 240 

caa etc gac tgg age egg att acc gca cca gtc gat ggt cgc gtt ggt 768 

Gin Leu Asp Trp Ser Arg He Thr Ala Pro Val Asp Gly Arg Val Gly 
245 250 255 

etc aag cag gtt gat gtt ggt aac caa ate tec agt ggt gat acc acc 816 

Leu Lys Gin Val Asp Val Gly Asn Gin He Ser Ser Gly Asp Thr Thr 

260 265 270 

ggg ate gtg gtg ate acc cag acg cat cct ate gat tta gtc ttt acc 864 

Gly He Val Val He Thr Gin Thr His Pro He Asp Leu Val Phe Thr 
275 280 285 

ctg ccg gaa age gat ate get acc gta gtg cag gcg cag aaa gee gga 912 

Leu Pro Glu Ser Asp He Ala Thr Val Val Gin Ala Gin Lys Ala Gly 
290 295 300 

aaa ccg ctg gtg gta gaa gee tgg gat cgc acc aac teg aag aaa tta 960 

Lys Pro Leu Val Val Glu Ala Trp Asp Arg Thr Asn Ser Lys Lys Leu 
305 310 315 320 
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agt gaa ggc acg ctg tta agt eta gat aac caa ate gat gec act acc 1008 
Ser Glu Gly Thr Leu Leu Ser Leu Asp Asn Gin lie Asp Ala Thr Thr 
325 330 335 

ggt acg att aaa gtg aaa gca cgc ttt aat aat cag gat gat gcg ctg 1056 
Gly Thr lie Lys Val Lys Ala Arg Phe Asn Asn Gin Asp Asp Ala Leu 
340 345 350 

ttt ccc aat cag ttt gtt aac gcg cgc atg tta gtc gac acc gaa caa 1104 
Phe Pro Asn Gin Phe Val Asn Ala Arg Met Leu Val Asp Thr Glu Gin 
355 360 365 

aac gec gta gtg ate cca aca gec gec ctg caa atg ggc aat gaa ggc 1152 
Asn Ala Val Val He Pro Thr Ala Ala Leu Gin Met Gly Asn Glu Gly 
370 375 380 

cat ttt gtc tgg gtg ctg aat age gaa aac aag gtc age aaa cat ctg 1200 
His Phe Val Trp Val Leu Asn Ser Glu Asn Lys Val Ser Lys His Leu 
385 390 395 400 

gtg acg ccg ggc att cag gac agt cag aaa gtg gtg ate cgt gca ggt 1248 
Val Thr Pro Gly He Gin Asp Ser Gin Lys Val Val He Arg Ala Gly 
405 410 415 

att tct gcg ggc gat cgc gtg gtg aca gac ggc att gat cgc ctg acc 1296 
He Ser Ala Gly Asp Arg Val Val Thr Asp Gly He Asp Arg Leu Thr 
420 425 430 

gaa ggg gcg aaa gtg gaa gtg gtg gaa gee cag age gee act act ccg 1344 
Glu Gly Ala Lys Val Glu Val Val Glu Ala Gin Ser Ala Thr Thr Pro 
435 440 445 

gaa gag aaa gec acc age cgc gaa tac gcg aaa aaa gga gca cgc tec 1392 
Glu Glu Lys Ala Thr Ser Arg Glu Tyr Ala Lys Lys Gly Ala Arg Ser 
450 455 460 

tga 1395 



<210> 271 
<211> 3123 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . . (3123) 

<400> 271 

atg cag gtg tta ccc ccg age age aca ggc ggc ccg teg cgc ctg ttt 48 
Met Gin Val Leu Pro Pro Ser Ser Thr Gly Gly Pro Ser Arg Leu Phe 
15 10 15 

att atg cgt cct gtg gec acc acg ctg ctg atg gtg gcg ate tta etc 96 
He Met Arg Pro Val Ala Thr Thr Leu Leu Met Val Ala He Leu Leu 
20 25 30 
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gcc ggg att ate ggt tat cgc gec ctg ccc gtt teg gcg ctg ccg gaa 14 4 

Ala Gly lie lie Gly Tyr Arg Ala Leu Pro Val Ser Ala Leu Pro Glu 
35 40 45 

gtg gac tat ccg acc att cag gtg gtc acg etc tac cca ggt gcc age 192 

Val Asp Tyr Pro Thr lie Gin Val Val Thr Leu Tyr Pro Gly Ala Ser 
50 55 60 

ccg gat gtc atg acc tct gcc gtt acc gcg ccg eta gaa cgc cag ttc 240 

Pro Asp Val Met Thr Ser Ala Val Thr Ala Pro Leu Glu Arg Gin Phe 

65 70 75 80 

ggg cag atg tct ggc ctg aaa cag atg teg teg caa agt tec ggc ggt 288 

Gly Gin Met Ser Gly Leu Lys Gin Met Ser Ser Gin Ser Ser Gly Gly 
85 90 95 

gcg tea gtt ate act ttg cag ttc cag eta aca tta ccg etc gat gtc 336 

Ala Ser Val lie Thr Leu Gin Phe Gin Leu Thr Leu Pro Leu Asp Val 
100 105 110 

gcc gag cag gaa gtg cag gcc gcg att aac get gcg acc aac ttg ttg 384 

Ala Glu Gin Glu Val Gin Ala Ala lie Asn Ala Ala Thr Asn Leu Leu 
115 120 125 

ccg age gat ctg cct aac ccg ccg gtt tac age aaa gtg aac ccg gca 432 

Pro Ser Asp Leu Pro Asn Pro Pro Val Tyr Ser Lys Val Asn Pro Ala 
130 *" 135 140 

gat ccg ccg ate atg acg etc gcc gtc acc tea acc gcc atg ccg atg 4 80 

Asp Pro Pro He Met Thr Leu Ala Val Thr Ser Thr Ala Met Pro Met 

145 150 155 160 

acg caa gtg gaa gat atg gtg gaa acc cgc gtc gcg cag aaa ate teg 528 

Thr Gin Val Glu Asp Met Val Glu Thr Arg Val Ala Gin Lys He Ser 
165 170 175 

cag att tec ggc gtc ggc ctg gtg acg ctt tec ggc ggt cag cgt ccg 57 6 

Gin He Ser Gly Val Gly Leu Val Thr Leu Ser Gly Gly Gin Arg Pro 
180 185 190 

get gtt cgc gtc aaa ctt aac get cag gcg att gcc gcc etc ggc ctg 624 

Ala Val Arg Val Lys Leu Asn Ala Gin Ala He Ala Ala Leu Gly Leu 
195 ' 200 205 

acc age gaa acc gtg cgc acc gcc att acc ggc get aac gtt aac teg 672 

Thr Ser Glu Thr Val Arg Thr Ala He Thr Gly Ala Asn Val Asn Ser 
210 215 220 

gca aaa ggt age etc gac ggc cct tec cgt gcg gtc acg ctt tec gcg 720 

Ala Lys Gly Ser Leu Asp Gly Pro Ser Arg Ala Val Thr Leu Ser Ala 

225 230 235 240 

aac gac cag atg caa tec gcc gaa gag tat cgc cag eta ate ate gcc 7 68 

Asn Asp Gin Met Gin Ser Ala Glu Glu Tyr Arg Gin Leu He He Ala 
245 250 255 

tac cag aac ggc gcg cca att cgt ctg ggc gat gtc gca act gta gag 816 

Tyr Gin Asn Gly Ala Pro He Arg Leu Gly Asp Val Ala Thr Val Glu 
260 265 270 
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caa ggt gca gaa aac age tgg etc ggc gcg tgg gcg aac aaa gaa cag 864 
Gin Gly Ala Glu Asn Ser Trp Leu Gly Ala Trp Ala Asn Lys Glu Gin 
275 280 285 

gee att gtg atg aat gtt cag cgc cag ccc ggt get aac att ate tec 912 
Ala lie Val Met Asn Val Gin Arg Gin Pro Gly Ala Asn lie lie Ser 
290 295 300 

ace gee gac age att egg cag atg ctg cca cag etc act gag agt ctg 960 
Thr Ala Asp Ser lie Arg Gin Met Leu Pro Gin Leu Thr Glu Ser Leu 
305 310 315 320 

ccg aaa teg gtg aag gtg aca gtg ctt tec gat cgc ace ace aat ate 1008 
Pro Lys Ser Val Lys Val Thr Val Leu Ser Asp Arg Thr Thr Asn lie 
325 330 335 

cgc gca tec gtc gat gat act cag ttt gaa ttg atg atg get ate gcg 1056 
Arg Ala Ser Val Asp Asp Thr Gin Phe Glu Leu Met Met Ala lie Ala 
340 345 350 

ctg gta gtc atg att ate tac ctg ttt ttg cgc aat att ccg gcg ace 1104 
Leu Val Val Met lie lie Tyr Leu Phe Leu Arg Asn lie Pro Ala Thr 
355 360 365 

ate att ccc ggt gtt get gta ccg ctg teg tta ate ggc act ttc gcg 1152 
lie lie Pro Gly Val Ala Val Pro Leu Ser Leu lie Gly Thr Phe Ala 
370 375 380 

gtt atg gtg ttt etc gat ttt tea ate aat aac ctg aca ctg atg gcg 1200 
Val Met Val Phe Leu Asp Phe Ser He Asn Asn Leu Thr Leu Met Ala 
385 390 395 400 

tta act ate gec acc gga ttc gtg gtc gat gac gee ate gtg gtg ate 1248 
Leu Thr He Ala Thr Gly Phe Val Val Asp Asp Ala He Val Val He 
405 410 415 

gaa aac att tec cgc tat ate gaa aaa ggc gaa aaa ccg ttg gcg gcg 1296 
Glu Asn He Ser Arg Tyr He Glu Lys Gly Glu Lys Pro Leu Ala Ala 
420 425 430 

gcg etc aag ggc gca ggt gaa ate ggc ttt acc att ate teg ctg acc 1344 
Ala Leu Lys Gly Ala Gly Glu lie Gly Phe Thr He He Ser Leu Thr 
435 440 445 

ttc tea ctg att gcg gtg ttg ate cca ctg ctg ttt atg ggc gat ate 1392 
Phe Ser Leu He Ala Val Leu He Pro Leu Leu Phe Met Gly Asp He 
450 455 460 

gtc ggg cga ctg ttc cgc gaa ttt get att acc ctg gcg gta gcg att 1440 
Val Gly Arg Leu Phe Arg Glu Phe Ala He Thr Leu Ala Val Ala He 
465 470 475 480 

ttg ate tea gcg gtg gtg teg ctg acc ctg aca ccg atg atg tgc gcg 1488 
Leu He Ser Ala Val Val Ser Leu Thr Leu Thr Pro Met Met Cys Ala 
485 490 495 

egg atg etc age cag gag teg ttg cgt aaa cag aac cgc ttc tec cgt 1536 
Arg Met Leu Ser Gin Glu Ser Leu Arg Lys Gin Asn Arg Phe Ser Arg 
500 505 510 
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gcc teg gaa aaa atg ttc gac agg ata ate gee gee tat ggt cgt gga 1584 
Ala Ser Glu Lys Met Phe Asp Arg lie He Ala Ala Tyr Gly Arg Gly 
515 520 525 



ctg gcg aaa gtg ctg aat cat ccg tgg ctg acc tta age gtg gca etc 
Leu Ala Lys Val Leu Asn His Pro Trp Leu Thr Leu Ser Val Ala Leu 



530 535 540 



1632 



age acg ctg ctg ctt age gtg ctg ctg tgg gtg ttc att ccg aaa ggt 1680 

Ser Thr Leu Leu Leu Ser Val Leu Leu Trp Val Phe He Pro Lys Gly 

545 550 555 560 

ttc ttc ccg gta cag gac aat ggc att att cag ggc act ttg cag gca 1728 

Phe Phe Pro Val Gin Asp Asn Gly He He Gin Gly Thr Leu Gin Ala 

565 570 575 

ccg caa tec age tec ttt gcc aat atg gcc cag cga caa cgc cag gtc 177 6 

Pro Gin Ser Ser Ser Phe Ala Asn Met Ala Gin Arg Gin Arg Gin Val 

580 585 590 

gcg gac gtg att ttg cag gat ccg gca gtg caa age ctg acc tea ttt 1824 

Ala Asp Val He Leu Gin Asp Pro Ala Val Gin Ser Leu Thr Ser Phe 

595 600 605 

gtt ggc gtt gat ggc act aac ccg teg ctg aac agt gca cgt tta caa 1872 

Val Gly Val Asp Gly Thr Asn Pro Ser Leu Asn Ser Ala Arg Leu Gin 

610 615 620 

ate aac etc aaa ccg ttg gat gaa cgt gat gat egg gtg caa aaa gtc 1920 

He Asn Leu Lys Pro Leu Asp Glu Arg Asp Asp Arg Val Gin Lys Val 

625 630 635 640 

ate gcc cgt ctg caa acg gcg gta gat aaa gtg ccg ggc gtc gat etc 1968 

He Ala Arg Leu Gin Thr Ala Val Asp Lys Val Pro Gly Val Asp Leu 

645 650 655 

ttc ctg caa cca acg cag gat ctg act att gat act cag gtc age cgc 2016 

Phe Leu Gin Pro Thr Gin Asp Leu Thr He Asp Thr Gin Val Ser Arg 

660 665 670 

acc cag tac cag ttt acc ttg cag gcc acg tea ctg gat gcg etc agt 2064 

Thr Gin Tyr Gin Phe Thr Leu Gin Ala Thr Ser Leu Asp Ala Leu Ser 

675 680 685 

acc tgg gtg cca cag ttg atg gaa aaa etc cag caa ctg cca cag ctt 2H2 

Thr Trp Val Pro Gin Leu Met Glu Lys Leu Gin Gin Leu Pro Gin Leu 

690 695 700 

tct gat gtc tec age gac tgg cag gac aaa ggg ctg gtg gcg tat gtc 2160 

Ser Asp Val Ser Ser Asp Trp Gin Asp Lys Gly Leu Val Ala Tyr Val 

705 710 715 720 

aat gtt gat cgc gac age gcc age cgt ctg ggg ate age atg gcg gat 2208 

Asn Val Asp Arg Asp Ser Ala Ser Arg Leu Gly He Ser Met Ala Asp 

725 730 735 

gtc gat aac gcc ctg tac aac gcg ttt ggt cag egg ctg att tec act 2256 

Val Asp Asn Ala Leu Tyr Asn Ala Phe Gly Gin Arg Leu He Ser Thr 

740 745 750 
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att tat act cag gcc aac cag tat cgc gtg gtg ctg gag cac aac acc 2304 
He Tyr Thr Gin Ala Asn Gin Tyr Arg Val Val Leu Glu His Asn Thr 
755 760 765 

gaa aat acc cca ggc etc gcg gcg ctg gat acc att cgc ctg acc age 2352 
Glu Asn Thr Pro Gly Leu Ala Ala Leu Asp Thr He Arg Leu Thr Ser 
770 775 780 

age gac ggc ggc gtg gtg ccg eta age tea att gcc aaa att gag cag 2400 
Ser Asp Gly Gly Val Val Pro Leu Ser Ser He Ala Lys He Glu Gin 
785 ~ 790 795 800 

cgt ttt gcg ccg etc tec ate aac cat ctg gat cag ttc ccg gta acg 2448 
Arg Phe Ala Pro Leu Ser He Asn His Leu Asp Gin Phe Pro Val Thr 
805 810 815 

acc ate tec ttt aac gtg ccg gat aac tat teg ctg ggc gat gcg gtg 24 96 
Thr He Ser Phe Asn Val Pro Asp Asn Tyr Ser Leu Gly Asp Ala Val 
820 825 830 

cag gcg att atg gac acc gaa aag acg ctg aat ctg ccg gtg gat ate 2544 
Gin Ala He Met Asp Thr Glu Lys Thr Leu Asn Leu Pro Val Asp He 
835 840 845 

acc acg cag ttc cag ggc age acc etc gcc ttc cag teg gcg ctg ggc 2592 
Thr Thr Gin Phe Gin Gly Ser Thr Leu Ala Phe Gin Ser Ala Leu Gly 
850 855 860 

age act gtc tgg ctg att gtc gcg gcg gtg gtg gcg atg tat ate gtg 2640 
Ser Thr Val Tip Leu He Val Ala Ala Val Val Ala Met Tyr He Val 
865 870 875 880 

etc ggc att ctg tac gag age ttt att cac ccg ate acc att etc teg 2688 
Leu Gly He Leu Tyr Glu Ser Phe He His Pro lie Thr He Leu Ser 
885 890 895 

acg eta ccc acc gca ggg gtt ggc gca ctg ctg gcg ttg ctg att get 2736 
Thr Leu Pro Thr Ala Gly Val Gly Ala Leu Leu Ala Leu Leu lie Ala 
900 905 910 

ggt age gaa ctg gat gtg att gcg att. ate ggc att att ttg ctg ate 2784 
Gly Ser Glu Leu Asp Val He Ala He He Gly lie lie Leu Leu lie 
915 920 925 

ggt ate gtg aag aag aac gcc ate atg atg ate gac ttc gcg ctg get 2832 
Gly lie Val Lys Lys Asn Ala lie Met Met lie Asp Phe Ala Leu Ala 
930 ~ 935 940 

get gag cgc gag caa ggc atg teg ccg cgc gag gca ate tac cag get 2880 
Ala Glu Arg Glu Gin Gly Met Ser Pro Arg Glu Ala He Tyr Gin Ala 
945 950 955 960 

tgt ctg ttg cgt ttt cgt ccg ate ctg atg acc act ctg gcg get ctg 2928 
Cys Leu Leu Arg Phe Arg Pro lie Leu Met Thr Thr Leu Ala Ala Leu 
965 970 975 

ctt ggc gcg ctg ccg ctg atg ttg agt acc ggg gtc ggc gcg gaa ctg 2976 
Leu Gly Ala Leu Pro Leu Met Leu Ser Thr Gly Val Gly Ala Glu Leu 
980 985 990 
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cgt cgt ccg tta ggt ate ggc atg gtc ggc ggt ctg att gtc age cag 3024 
Arg Arg Pro Leu Gly He Gly Met Val Gly Gly Leu He Val Ser Gin 
995 1000 1005 

gtg ctg acg ctg ttt acc acg ccg gtg att tat ttg ctg ttc gac cgc 3072 
Val Leu Thr Leu Phe Thr Thr Pro Val He Tyr Leu Leu Phe Asp Arg 
1010 1015 1020 

ctg gca ttg tgg acc aaa age cgc ttt gec cgt cat gaa gag gag gcg 3120 
Leu Ala Leu Trp Thr Lys Ser Arg Phe Ala Arg His Glu Glu Glu Ala 
1025 1030 1035 1040 

taa 3123 



<210> 272 
<211> 3078 
<212> DNA 

<213> Escherichia coli 



<220> 
<221> CDS 
<222> (1) . 



(3078) 



<400> 272 

gtg aag ttt ttt gee etc ttc att tac cgc ccg gtg gcg acg att tta 
Met Lys Phe Phe Ala Leu Phe He Tyr Arg Pro Val Ala Thr lie Leu 
15 10 15 



48 



ctg teg gtt gee att acc ctg tgc ggc ata ctg ggc ttc cgt atg ctg 
Leu Ser Val Ala He Thr Leu Cys Gly He Leu Gly Phe Arg Met Leu 
20 25 30 



96 



ccg gtc gec ccg ctg ccg cag gtc gat ttt ccg gtg att ate gtc age 
Pro Val Ala Pro Leu Pro Gin Val Asp Phe Pro Val He He Val Ser 
35 40 45 



144 



gee teg ctg ccc ggt gcg tea cca gaa aca atg gcg tct tec gtt gee 
Ala Ser Leu Pro Gly Ala Ser Pro Glu Thr Met Ala Ser Ser Val Ala 
50 55 60 



192 



acg ccg ctg gag cgc tea ctt ggg cgc att gee gga gtc agt gaa atg 
Thr Pro Leu Glu Arg Ser Leu Gly Arg He Ala Gly Val Ser Glu Met 
65 70 75 80 



240 



acc tec age agt teg etc ggc age acg cgt att att ttg cag ttt gat 
Thr Ser Ser Ser Ser Leu Gly Ser Thr Arg He He Leu Gin Phe Asp 
85 90 95 



288 



ttt gac egg gat ate aac ggc gca gcg cgt gat gtg cag gcg gcg ate 
Phe Asp Arg Asp He Asn Gly Ala Ala Arg Asp Val Gin Ala Ala He 
100 105 110 



336 



aac get gca caa agt ttg ctg ccc agt ggg atg ccc age cgc ccg acc 
Asn Ala Ala Gin Ser Leu Leu Pro Ser Gly Met Pro Ser Arg Pro Thr 
115 120 125 



384 



tat cgc aaa gcg aac ccg teg gat gcg cca att atg ate etc acg ctg 



432 
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Tyr Arg Lys Ala Asn Pro Ser Asp Ala Pro lie Met lie Leu Thr Leu 
130 ^ 135 140 

acg tec gat act tat teg cag ggt gaa ctg tac gat ttc gee teg acg 480 
Thr Ser Asp Thr Tyr Ser Gin Gly Glu Leu Tyr Asp Phe Ala Ser Thr 
145 150 155 160 

cag ctg get ccg acg att teg caa ate gac ggt gtt ggt gat gtc gat 528 
Gin Leu Ala Pro Thr lie Ser Gin lie Asp Gly Val Gly Asp Val Asp 
165 170 175 

gtc gga ggc age tea ctg ccc gee gta cgc gtc ggg ctg aat ccg cag 576 
Val Gly Gly Ser Ser Leu Pro Ala Val Arg Val Gly Leu Asn Pro Gin 
180 185 190 

gcg ctg ttt aat cag ggc gtg teg ctg gac gac gta cgc ace gee gtc 624 
Ala Leu Phe Asn Gin Gly Val Ser Leu Asp Asp Val Arg Thr Ala Val 
195 200 205 

age aat gee aac gtg cgt aaa ccg cag ggc gcg ctg gaa gat ggc act 672 
Ser Asn Ala Asn Val Arg Lys Pro Gin Gly Ala Leu Glu Asp Gly Thr 
210 215 220 

cac cgc tgg cag ate cag ace aat gat gag eta aaa ace gee get gaa 720 
His Arg Trp Gin He Gin Thr Asn Asp Glu Leu Lys Thr Ala Ala Glu 
225 230 235 240 

tat cag ccg ttg att att cac tac aac aac ggc ggc gcg gtt cgt ctg 7 68 

Tyr Gin Pro Leu He He His Tyr Asn Asn Gly Gly Ala Val Arg Leu 
245 250 255 

ggc gat gtg gcg acg gtg ace gac tea gtg cag gat gtg cgc aac gee 816 
Gly Asp Val Ala Thr Val Thr Asp Ser Val Gin Asp Val Arg Asn Ala 
260 265 270 

ggg atg ace aac gec aaa ccg get att tta ctg atg ate cgc aaa ctg 8 64 

Gly Met Thr Asn Ala Lys Pro Ala He Leu Leu Met He Arg Lys Leu 
275 280 285 

ccg gaa gee aat att ate cag acg gtt gac age ate egg gca aaa tta 912 
Pro Glu Ala Asn He He Gin Thr Val Asp Ser He Arg Ala Lys Leu 
290 295 300 

ccg gag ttg cag gaa acc att ccg gcg gcg att gat ctg caa att gec 960 
Pro Glu Leu Gin Glu Thr He Pro Ala Ala He Asp Leu Gin He Ala 
305 310 315 320 

cag gat cgc tec ccc acc att cgc gec teg ctg gaa gaa gtc gag caa 1008 
Gin Asp Arg Ser Pro Thr He Arg Ala Ser Leu Glu Glu Val Glu Gin 
325 330 335 

acg ctg att ate teg gtg gcg ctg gtg att ctg gtg gtg ttt tta ttc 1056 
Thr Leu He He Ser Val Ala Leu Val He Leu Val Val Phe Leu Phe 
340 345 - 350 

ctg cgc teg ggt cgc gee act att att ccc gec gtt teg gtg ccg gtt 1104 
Leu Arg Ser Gly Arg Ala Thr He He Pro Ala Val Ser Val Pro Val 
355 360 365 

teg ctg att ggt acg ttt gcg gcg atg tac ctg tgc gga ttc agt etc 1152 
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Ser Leu He Gly Thr Phe Ala Ala Met Tyr Leu Cys Gly Phe Ser Leu 

370 375 380 

aat aac ctt teg tta atg gcg etc acc ate get act ggt ttc gtg gtg 1200 

Asn Asn Leu Ser Leu Met Ala Leu Thr He Ala Thr Gly Phe Val Val 

385 390 395 400 

gat gac gec ate gtg gtg ctg gaa aac att gca cgt cat ctg gaa gcg 1248 

Asp Asp Ala He Val Val Leu Glu Asn He Ala Arg His Leu Glu Ala 
405 410 415 

gga atg aaa ccg ttg caa gec gca ctg caa ggt act cgc gaa gtc ggt 1296 

Gly Met Lys Pro Leu Gin Ala Ala Leu Gin Gly Thr Arg Glu Val Gly 
420 425 * 430 

ttt acg gtg ctg teg atg agt ctg tea ctg gtg gcg gtg ttc ctg ccg 1344 

Phe Thr Val Leu Ser Met Ser Leu Ser Leu Val Ala Val Phe Leu Pro 

435 440 445 

ctg ctg ttg atg ggc gga ttg ccg ggc cga ctg tta cgc gaa ttt gec 1392 

Leu Leu Leu Met Gly Gly Leu Pro Gly Arg Leu Leu Arg Glu Phe Ala 

450 455 460 

gtg acg ctt tct gtc gec att ggt ata teg ttg ctg gtt tct ctg aca 1440 

Val Thr Leu Ser Val Ala He Gly He Ser Leu Leu Val Ser Leu Thr 

465 470 475 480 

tta acg cca atg atg tgt ggc tgg atg ctg aaa gec age aag ccg cgc 1488 

Leu Thr Pro Met Met Cys Gly Trp Met Leu Lys Ala Ser Lys Pro Arg 
485 490 ~ 495 

gag caa aag cga ctg cgt ggt ttt ggt cgc atg ttg gta gec ctg caa 1536 

Glu Gin Lys Arg Leu Arg Gly Phe Gly Arg Met Leu Val Ala Leu Gin 
500 505 510 

caa ggc tac ggc aag tea eta aaa tgg gtg etc aat cat acc cgt ctg 1584 

Gin Gly Tyr Gly Lys Ser Leu Lys Trp Val Leu Asn His Thr Arg Leu 

515 520 525 

gtg ggc gtg gtg ctg ctt ggc acc att gcg ctg aat ate tgg ctg tat 1632 

Val Gly Val Val Leu Leu Gly Thr He Ala Leu Asn He Trp Leu Tyr 

530 535 540 

ate teg ate ccg aaa acc ttc ttc ccg gag cag gac act ggc gtg ttg 1680 

He Ser He Pro Lys Thr Phe Phe Pro Glu Gin Asp Thr Gly Val Leu 

545 550 555 560 

atg ggc ggg att cag gcg gat cag agt att teg ttt cag gcg atg cgc 1728 

Met Gly Gly He Gin Ala Asp Gin Ser He Ser Phe Gin Ala Met Arg 
565 570 575 

ggt aag ttg cag gat ttc atg aaa att ate cgt gac gat ccg gca gtg 1776 

Gly Lys Leu Gin Asp Phe Met Lys He He Arg Asp Asp Pro Ala Val 
580 585 590 

gat aat gtc acc ggc ttt aca ggc ggt teg cga gtg aac age ggg atg 1824 

Asp Asn Val Thr Gly Phe Thr Gly Gly Ser Arg Val Asn Ser Gly Met 

595 600 605 

atg ttt ate acc etc aag cca cgc gac gaa cgc age gaa acg gcg cag 1872 
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Met Phe lie Thr Leu Lys Pro Arg Asp Glu Arg Ser Glu Thr Ala Gin 
610 615 620 

caa att ate gac cgt ctg cgc gta aaa ctg gcg aaa gaa ccg ggg gcg 1920 
Gin He He Asp Arg Leu Arg Val Lys Leu Ala Lys Glu Pro Gly Ala 
625 630 635 640 

aat ctg ttc ctg atg gcg gta cag gat att cgc gtt ggt ggg cgt cag 1968 
Asn Leu Phe Leu Met Ala Val Gin Asp He Arg Val Gly Gly Arg Gin 
645 650 655 

teg aac gec age tac cag tac acg ttg tta tec gac gac ctg gcg gca 2016 
Ser Asn Ala Ser Tyr Gin Tyr Thr Leu Leu Ser Asp Asp Leu Ala Ala 
660 665 670 

ctg cga gaa tgg gag ccg aaa ate cgc aaa aaa ctg gcg acg ttg ccg 2064 
Leu Arg Glu Trp Glu Pro Lys He Arg Lys Lys Leu Ala Thr Leu Pro 
675 680 685 

gaa ctg gcg gac gtg aac tec gat cag cag gat aac ggc gcg gag atg 2112 
Glu Leu Ala Asp Val Asn Ser Asp Gin Gin Asp Asn Gly Ala Glu Met 
690 695 700 

aat ctg gtt tac gac cgc gac ace atg gca egg ctg gga ate gac gta 2160 
Asn Leu Val Tyr Asp Arg Asp Thr Met Ala Arg Leu Gly He Asp Val 
705 710 715 720 

caa gee gee aac agt ctg tta aat aac gec ttc ggt cag egg caa ate 2208 
Gin Ala Ala Asn Ser Leu Leu Asn Asn Ala Phe Gly Gin Arg Gin He 
725 730 735 

teg acc att tac cag ccg atg aac cag tat aaa gtg gtg atg gaa gtg 2256 
Ser Thr He Tyr Gin Pro Met Asn Gin Tyr Lys Val Val Met Glu Val 
740 745 750 

gat ccg cgc tat acc cag gac ate agt gcg ctg gaa aaa atg ttc gtt 2304 
Asp Pro Arg Tyr Thr Gin Asp He Ser Ala Leu Glu Lys Met Phe Val 
755 760 765 

ate aat aac gaa ggc aaa gcg ate ccg ctg teg tat ttc get aaa tgg 2352 
lie Asn Asn Glu Gly Lys Ala He Pro Leu Ser Tyr Phe Ala Lys Trp 
770 775 780 

caa ccg gcg aat gee cca eta teg gtg aat cat cag gga tta teg gcg 2400 
Gin Pro Ala Asn Ala Pro Leu Ser Val Asn His Gin Gly Leu Ser Ala 
785 790 795 800 

gee teg acc att teg ttt aac ctg ccg acc gga aaa teg etc teg gac 2448 
Ala Ser Thr He Ser Phe Asn Leu Pro Thr Gly Lys Ser Leu Ser Asp 
805 810 815 

gee agt gcg gcg ate gat cgc gca atg acc cag ctt ggt gtg cct teg 2496 
Ala Ser Ala Ala He Asp Arg Ala Met Thr Gin Leu Gly Val Pro Ser 
820 825 830 

acg gtg cgc ggc agt ttt gec ggc acg gcg cag gtg ttc cag gag acg 254 4 
Thr Val Arg Gly Ser Phe Ala Gly Thr Ala Gin Val Phe Gin Glu Thr . 
835 840 845 

atg aac teg cag gtg ate ctg att ate gec gee ate gec acg gtg tat 2592 
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Met Asn Ser Gin Val He Leu He He Ala Ala lie Ala Thr Val Tyr 

850 855 860 

ate gtg ctg ggt ate ctt tac gag agt tac gta cat ccg ctg acg att 2640 

He Val Leu Gly He Leu Tyr Glu Ser Tyr Val His Pro Leu Thr He 

865 870 875 880 

etc tec ace ctg ccc teg gcg ggc gtt gga gcg ctg ttg gcg ctg gag 2688 

Leu Ser Thr Leu Pro Ser Ala Gly Val Gly Ala Leu Leu Ala Leu Glu 
885 890 895 

ctg ttc aat gee ccg ttc age eta ate gec ctg ata ggg ate atg eta 2736 

Leu Phe Asn Ala Pro Phe Ser Leu He Ala Leu He Gly He Met Leu 

900 905 910 

tta ate ggc ate gtg aag aaa aac gee att atg atg gtc gat ttt gcg 2784 

Leu He Gly He Val Lys Lys Asn Ala He Met Met Val Asp Phe Ala 

915 920 925 

ctt gaa gee caa egg cac ggt aac ctg acg ccg cag gaa get att ttc 2832 

Leu Glu Ala Gin Arg His Gly Asn Leu Thr Pro Gin Glu Ala He Phe 

930 935 940 

cag gec tgt ctg ctg cgt ttt cgc ccg att atg atg act acc ctg gcg 2880 

Gin Ala Cys Leu Leu Arg Phe Arg Pro He Met Met Thr Thr Leu Ala 

945 950 955 960 

gcg ctg ttt ggt gcg ctg ccg ctg gta ttg teg ggc ggc gac ggc teg 2928 

Ala Leu Phe Gly Ala Leu Pro Leu Val Leu Ser Gly Gly Asp Gly Ser 
965 970 975 

gag ctg egg caa ccc ctg ggg ate acc att gtc ggc gga ctg gta atg 297 6 

Glu Leu Arg Gin Pro Leu Gly lie Thr He Val Gly Gly Leu Val Met 

980 985 990 

age cag etc ctt acg ctg tat acc acg ccg gtg gtg tat etc ttt ttc 3024 

Ser Gin Leu Leu Thr Leu Tyr Thr Thr Pro Val Val Tyr Leu Phe Phe 

995 1000 1005 

gac cgt ctg egg ctg cgt ttt teg cgt aaa cct aaa caa acg gta acc 3072 

Asp Arg Leu Arg Leu Arg Phe Ser Arg Lys Pro Lys Gin Thr Val Thr 

1010 1015 1020 

gag taa 3078 

Glu * 

1025 



<210> 273 
<211> 1416 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . . (1416) 

<400> 273 

atg aca gat ctt ccc gac age acc cgt tgg caa ttg tgg att gtg get 48 
Met Thr Asp Leu Pro Asp Ser Thr Arg Trp Gin Leu Trp He Val Ala 
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15 10 15 

ttc ggc ttc ttt atg cag teg ctg gac acc acc ate gta aac acc gec 96 
Phe Gly Phe Phe Met Gin Ser Leu Asp Thr Thr lie Val Asn Thr Ala 
20 25 30 

ctt ccc tea atg gcg caa age etc ggg gaa agt ccg ttg cat atg cac 144 
Leu Pro Ser Met Ala Gin Ser Leu Gly Glu Ser Pro Leu His Met His 
35 40 45 

atg gtc att gtc tct tat gtg ctg acc gtg gcg gtg atg ctg ccc gee 192 
Met Val He Val Ser Tyr Val Leu Thr Val Ala Val Met Leu Pro Ala 
50 55 60 

age ggc tgg ctg gcg gac aaa gtc ggc gtg cgc aat att ttc ttt acc 240 
Ser Gly Trp Leu Ala Asp Lys Val Gly Val Arg Asn He Phe Phe Thr 
65 70 75 80 

gec ate gtg ctg ttt act etc ggt tea ctg ttt tgc gcg ctt tec ggc 288 
Ala He Val Leu Phe Thr Leu Gly Ser Leu Phe Cys Ala Leu Ser Gly 
85 90 95 

acg ctg aac gaa ctg ttg ctg gca cgc gcg tta cag ggc gtt ggc ggc 336 
Thr Leu Asn Glu Leu Leu Leu Ala Arg Ala Leu Gin Gly Val Gly Gly- • 
100 105 110 

gcg atg atg gtg ccg gtc ggc aga ttg acg gtg atg aaa ate gta ccg 384 
Ala Met Met Val Pro Val Gly Arg Leu Thr Val Met Lys lie Val Pro 
115 120 125 

cgc gag caa tat atg gcg gcg atg acc ttt gtc acg tta ccc ggt cag 432 
Arg Glu Gin Tyr Met Ala Ala Met Thr Phe Val Thr Leu Pro Gly Gin 
130 135 140 

gtc ggt ccg ctg etc ggt ccg gcg etc ggc ggt ctg ctg gtg gag tac 480 
Val Gly Pro Leu Leu Gly Pro Ala Leu Gly Gly Leu Leu Val Glu Tyr 
145 150 155 160 

gca teg tgg cac tgg ate ttt ttg ate aac att ccg gtg ggg att ate 528 
Ala Ser Trp His Trp He Phe Leu He Asn He Pro Val Gly He He 
165 170 175 

ggt gcg ate gec aca ttg ctg tta atg ccg aac tac acc atg cag acg 57 6 

Gly Ala He Ala Thr Leu Leu Leu Met Pro Asn Tyr Thr Met Gin Thr 
180 185 190 

egg cgc ttt gat etc tec gga ttt tta ttg ctg gcg gtt ggc atg gcg 624 
Arg Arg Phe Asp Leu Ser Gly Phe Leu Leu Leu Ala Val Gly Met Ala 
195 " 200 205 

gta tta acc ctg gcg ctg gac ggc agt aaa ggt aca ggt tta teg ccg 672 
Val Leu Thr Leu Ala Leu Asp Gly Ser Lys Gly Thr Gly Leu Ser Pro 
210 215 220 

ctg acg att gca ggc ctg gtc gca gtt ggc gtg gtg gca ctg gtg ctt 720 
Leu Thr He Ala Gly Leu Val Ala Val Gly Val Val Ala Leu Val Leu 
225 230 235 240 

tat ctg ctg cac gec aga aat aac aac cgt gec ctg ttc agt ctg aaa 7 68 

Tyr Leu Leu His Ala Arg Asn Asn Asn Arg Ala Leu Phe Ser Leu Lys 
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245 250 255 

ctg ttc cgt act cgt acc ttt teg ctg ggc ctg gcg ggg age ttt gee 816 
Leu Phe Arg Thr Arg Thr Phe Ser Leu Gly Leu Ala Gly Ser Phe Ala 
260 265 270 

gga cgt att ggc agt ggc atg ttg ccc ttt atg aca ccg gtt ttc ctg 864 
Gly Arg lie Gly Ser Gly Met Leu Pro Phe Met Thr Pro Val Phe Leu 
275 280 285 

caa att ggc etc ggt ttc teg ccg ttt cat gec gga ctg atg atg ate 912 
Gin He Gly Leu Gly Phe Ser Pro Phe His Ala Gly Leu Met Met He 
290 295 300 

ccg atg gtg ctt ggc age atg gga atg aag cga att gtg gta cag gtg 960 
Pro Met Val Leu Gly Ser Met Gly Met Lys Arg He Val Val Gin Val 
305 310 315 320 

gtg aat cgc ttt ggt tat cgt egg gta ctg gta gcg acc acg ctg ggt 1008 
Val Asn Arg Phe Gly Tyr Arg Arg Val Leu Val Ala Thr Thr Leu Gly 
325 330 335 

ctg teg ctg gtc acc ctg ttg ttt atg act acc gee ctg ctg ggc tgg 1056 
Leu Ser Leu Val Thr Leu Leu Phe Met Thr Thr Ala Leu Leu Gly Trp 
340 345 350 

tac tac gtt ttg ccg ttc gtc ctg ttt tta caa ggg atg gtc aac teg 1104 
Tyr Tyr Val Leu Pro Phe Val Leu Phe Leu Gin Gly Met Val Asn Ser 
355 360 365 

acg cgt ttc tec tec atg aac acc ctg acg ctg aaa gat etc ccg gac 1152 
Thr Arg Phe Ser Ser Met Asn Thr Leu Thr Leu Lys Asp Leu Pro Asp 
370 375 380 

aat ctg gcg age age ggc aac age ctg ctg teg atg att atg caa ttg 1200 
Asn Leu Ala Ser Ser Gly Asn Ser Leu Leu Ser Met He Met Gin Leu 
385 390 395 400 

teg atg agt ate ggc gtc act ate gec ggg ctg ttg ctg gga ctt ttt 1248 
Ser Met Ser He Gly Val Thr He Ala Gly Leu Leu Leu Gly Leu Phe 
405 410 415 

ggt tea cag cat gtc age gtc gac age ggc acc aca caa acc gtc ttt 1296 
Gly Ser Gin His Val Ser Val Asp Ser Gly Thr Thr Gin Thr Val Phe 
420 425 430 

atg tac acc tgg ctt age atg gcg ttg ate ate gec ctt ccg gcg ttc 1344 
Met Tyr Thr Trp Leu Ser Met Ala Leu He He Ala Leu Pro Ala Phe 
435 440 445 

ate ttt gee aga gtg ccg aac gat acg cat caa aat gta get att teg 1392 
He Phe Ala Arg Val Pro Asn Asp Thr His Gin Asn Val Ala He Ser 
450 455 460 

egg cga aaa agg age gcg caa tga 1416 
Arg Arg Lys Arg Ser Ala Gin * 
465 470 
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<211> 417 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . . (417) 

<400> 274 

gtg att gcg ttt gaa gtg ttg aag tgg gca ggc gcg get tac ttg att 48 
Met He Ala Phe Glu Val Leu Lys Trp Ala Gly Ala Ala Tyr Leu He 
15 10 15 

tgg ctg gga ate cag cag tgg cgc gec get ggt gca att gac ctt aaa 96 
Trp Leu Gly He Gin Gin Trp Arg Ala Ala Gly Ala He Asp Leu Lys 
20 25 30 

teg ctg gee tct act caa teg cgt cga cat ttg ttc cag cgc gca gtt 14 4 

Ser Leu Ala Ser Thr Gin Ser Arg Arg His Leu Phe Gin Arg Ala Val 
35 40 45 

ttt gtg aat etc acc aat ccc aaa agt att gtg ttt ctg gcg gcg eta 192 
Phe Val Asn Leu Thr Asn Pro Lys Ser He Val Phe Leu Ala Ala Leu 
50 55 60 

ttt ccg caa ttc ate atg ccg caa cag ccg caa ctg atg cag tat ate 240 
Phe Pro Gin Phe He Met Pro Gin Gin Pro Gin Leu Met Gin Tyr He 
65 70 75 80 

gtg etc ggc gtc acc act att gtg gtc gat att att gtg atg ate ggt 288 
Val Leu Gly Val Thr Thr He Val Val Asp He He Val Met He Gly 
85 90 95 

tac gee acc ctt get caa egg att get eta tgg att aaa gga cca aag 336 
Tyr Ala Thr Leu Ala Gin Arg He Ala Leu Trp He Lys Gly Pro Lys 
100 105 110 

cag atg aag gcg ctg aat aag att ttc ggc teg ttg ttt atg ctg gtg 384 
Gin Met Lys Ala Leu Asn Lys He Phe Gly Ser Leu Phe Met Leu Val 
115 120 125 

gga gcg ctg tta gca teg gcg agg cat gcg tga 417 
Gly Ala Leu Leu Ala Ser Ala Arg His Ala * 
130 135 



<210> 275 
<211> 774 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . . (774) 

<400> 275 

atg get cgt aaa tgg ttg aac ttg ttt gee ggg gcg gca etc tct ttc 4 8 

Met Ala Arg Lys Trp Leu Asn Leu Phe Ala Gly Ala Ala Leu Ser Phe 
15 10 15 
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get gtt get ggc aat gca ctg gca gat gaa ggg aaa ate acg gtg ttc 96 

Ala Val Ala Gly Asn Ala Leu Ala Asp Glu Gly Lys lie Thr Val Phe 

20 25 30 

gee gee gca tea ctg act aac gca atg cag gac att get acg cag ttt 14 4 

Ala Ala Ala Ser Leu Thr Asn Ala Met Gin Asp lie Ala Thr Gin Phe 

35 40 45 

aaa aaa gag aaa ggc gtg gat gtg gtt tct tct ttc get teg tea tct 192 

Lys Lys Glu Lys Gly Val Asp Val Val Ser Ser Phe Ala Ser Ser Ser 

50 55 60 

act etc gee cgt cag att gaa gcg ggt gcg cct gcg gat ctg ttt att 24 0 

Thr Leu Ala Arg Gin lie Glu Ala Gly Ala Pro Ala Asp Leu Phe lie 

65 70 75 80 

tct gee gat cag aaa tgg atg gat tat gcg gtt gat aaa aaa gcg ate 288 

Ser Ala Asp Gin Lys Trp Met Asp Tyr Ala Val Asp Lys Lys Ala lie 

85 90 95 

gat aca get acg cgt cag aca ctg etc ggc aat age ctg gtc gtt gta 336 

Asp Thr Ala Thr Arg Gin Thr Leu Leu Gly Asn Ser Leu Val Val Val 

100 105 110 

gca ccg aaa gee age gtg cag aaa gat ttc ace ate gac age aaa ace 384 

Ala Pro Lys Ala Ser Val Gin Lys Asp Phe Thr lie Asp Ser Lys Thr 

115 120 " 125 

aac tgg act tea ctg ctg aat ggc ggt cgc ctg gcg gtt ggc gat ccg 432 

Asn Trp Thr Ser Leu Leu Asn Gly Gly Arg Leu Ala Val Gly Asp Pro 

130 135 140 

gaa cat gtt ccc get ggc att tat gca aaa gaa gca ctg caa aaa ctg 480 

Glu His Val Pro Ala Gly lie Tyr Ala Lys Glu Ala Leu Gin Lys Leu 

145 150 155 160 

ggc gca tgg gat acg etc tct ccg aaa ctg gee cca gcg gaa gat gtt 528 

Gly Ala Trp Asp Thr Leu Ser Pro Lys Leu Ala Pro Ala Glu Asp Val 

165 170 175 

cgt ggg gcg ctg gcg ctg gtc gaa cgt aac gaa gcg cct ctg ggc att 576 

Arg Gly Ala Leu Ala Leu Val Glu Arg Asn Glu Ala Pro Leu Gly lie 

180 185 190 

gtc tac ggt tct gac gca gtt gee age aaa ggg gta aaa gtg gtt gee 624 

Val Tyr Gly Ser Asp Ala Val Ala Ser Lys Gly Val Lys Val Val Ala 

195 200 205 

acc ttc ccg gaa gat tea cat aaa aaa gtg gaa tat ccg gtt get gtt 672 

Thr Phe Pro Glu Asp Ser His Lys Lys Val Glu Tyr Pro Val Ala Val 

210 215 220 

gtg gaa ggg cat aac aat gcg aca gtg aaa gca ttt tat gat tat ctg 720 

Val Glu Gly His Asn Asn Ala Thr Val Lys Ala Phe Tyr Asp Tyr Leu 

225 ~ 230 235 240 

aag gga ccg cag gca gcg gaa ate ttt aaa cgt tac gga ttt aca ate 768 

Lys Gly Pro Gin Ala Ala Glu lie Phe Lys Arg Tyr Gly Phe Thr lie 

245 250 255 
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aag taa 774 
Lys * 



<210> 276 
<211> 690 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . . (690) 

<400> 276 

atg ata ctg acc gat cca gaa tgg cag gca gtt tta tta age ctg aaa 48 
Met He Leu Thr Asp Pro Glu Trp Gin Ala Val Leu Leu Ser Leu Lys 
15 10 15 

gtt tct tec ctg get gtg ctg ttt age ctg ccg ttt ggg ate ttt ttt 96 
Val Ser Ser Leu Ala Val Leu Phe Ser Leu Pro Phe Gly lie Phe Phe 
20 25 30 

gec tgg tta ctg gtg cgt tgc acg ttt ccg ggc aaa get ctg etc gac 144 
Ala Trp Leu Leu Val Arg Cys Thr Phe Pro Gly Lys Ala Leu Leu Asp 
35 40 45 

age gta ctg cat eta ccg ctg gtg tta ccg ccc gtg gtc gtc ggt tac 192 
Ser Val Leu His Leu Pro Leu Val Leu Pro Pro Val Val Val Gly Tyr 
50 55 60 

tta tta tta gtt teg atg gga egg cgc gga ttt ate ggt gaa cgt ctg 240 
Leu Leu Leu Val Ser Met Gly Arg Arg Gly Phe He Gly Glu Arg Leu 
65 70 75 80 

tat gac tgg ttt ggt att acc ttc gee ttt age tgg cgc ggc gcg gtt 288 
Tyr Asp Trp Phe Gly He Thr Phe Ala Phe Ser Trp Arg Gly Ala Val 
85 90 95 

etc get gec gee gtc atg teg ttt ccg ctg atg gtg egg gca att cgt 336 
Leu Ala Ala Ala Val Met Ser Phe Pro Leu Met Val Arg Ala He Arg 
100 105 110 

ctg gcg ctg gaa ggg gtt gat gtc aaa ctg gaa cag gec gca aga aca 384 
Leu Ala Leu Glu Gly Val Asp Val Lys Leu Glu Gin Ala Ala Arg Thr 
115 ~ 120 125 

ctg ggg gec ggg cgc tgg cgc gtt ttc ttt act ate acg tta ccg ctg 432 
Leu Gly Ala Gly Arg Trp Arg Val Phe Phe Thr lie Thr Leu Pro Leu 
130 " 135 140 

acc tta ccg gga att att gtt ggt acg gta ctg get ttt get cgt tct 480 
Thr Leu Pro Gly He He Val Gly Thr Val Leu Ala Phe Ala Arg Ser 
145 150 155 160 

etc ggt gag ttt ggt gca acc ate acc ttt gtg teg aac att cct ggt 528 
Leu Gly Glu Phe Gly Ala Thr He Thr Phe Val Ser Asn He Pro Gly 
165 170 175 

gaa acg cga acc att cct tct gee atg tat acc ctg ate cag acc ccc 576 



-251- 



WO 01/34810 



PCTAJSOO/30950 



Glu Thr Arg Thr lie Pro Ser Ala Met Tyr Thr Leu lie Gin Thr Pro 
180 185 190 

ggc ggg gaa agt gga gcg gcg aga ctg tgc att att tct att gcg ctg 624 
Gly Gly Glu Ser Gly Ala Ala Arg Leu Cys lie lie Ser lie Ala Leu 
195 200 205 

gcg atg ate tec ctg ttg att tea gaa tgg ctg gee aga ate age cgt 672 
Ala Met lie Ser Leu Leu lie Ser Glu Trp Leu Ala Arg lie Ser Arg 
210 215 220 



gaa egg gcg ggg cgc taa 690 

Glu Arg Ala Gly Arg * 

225 



<210> 277 
<211> 1059 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . . (1059) 

<400> 277 

atg ctg gaa ctg aat ttt tec cag acg ttg ggc aac cat tgc ctg act 48 

Met Leu Glu Leu Asn Phe Ser Gin Thr Leu Gly Asn liis Cys Leu Thr 
1 5 10 ^ 15 

att aat gaa acg ctg ccc gee aat ggc ate act get ate ttt ggc gtc 96 
lie Asn Glu Thr Leu Pro Ala Asn Gly lie Thr Ala lie Phe Gly Val 
20 25 30 

tec ggt gee gga aaa act teg ctg att aac gee ate agt gga ctg acg 144 
Ser Gly Ala Gly Lys Thr Ser Leu lie Asn Ala lie Ser Gly Leu Thr 
35 40 45 

cgc ccg caa aaa ggg egg att gtc etc aat ggg egg gta eta aat gat 192 
Arg Pro Gin Lys Gly Arg lie Val Leu Asn Gly Arg Val Leu Asn Asp 
50 55 60 

gee gaa aaa ggt ate tgc ctg acg ccg gaa aag cgt cgc gtt ggc tat 240 
Ala Glu Lys Gly lie Cys Leu Thr Pro Glu Lys Arg Arg Val Gly Tyr 
65 70 75 80 

gtt ttt cag gat gcg egg ctg ttc ccg cat tac aaa gtg cgt ggc aat 288 
Val Phe Gin Asp Ala Arg Leu Phe Pro His Tyr Lys Val Arg Gly Asn 
85 90 95 

ctg cgc tac ggc atg teg aaa agt atg gtc gat cag ttc gat aag ctg 336 
Leu Arg Tyr Gly Met Ser Lys Ser Met Val Asp Gin Phe Asp Lys Leu 
100 105 110 

gtg gcg ctt tta ggc att gaa ccg ttg ctt gae cgt tta cca ggc age 384 
Val Ala Leu Leu Gly lie Glu Pro Leu Leu Asp Arg Leu Pro Gly Ser 
115 120 125 

ctg tec gga ggc gaa aaa cag cgc gtg gcg att: ggt egg get ttg ctg 4 32 

Leu Ser Gly Gly Glu Lys Gin Arg Val Ala lie Gly Arg Ala Leu Leu 
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130 135 140 

aca gca ccg gaa ttg ctg ttg ctg gat gaa ccg ctg gcg tea ctg gat 480 
Thr Ala Pro Glu Leu Leu Leu Leu Asp Glu Pro Leu Ala Ser Leu Asp 
145 150 155 160 

att ccg cgt aaa cgc gaa ctg ttg cct tat ctg caa egg ctg aca egg 528 
lie Pro Arg Lys Arg Glu Leu Leu Pro Tyr Leu Gin Arg Leu Thr Arg 
165 170 175 

gaa ate aac att ccg atg ttg tat gtc age cat teg ctg gat gag ate 576 
Glu lie Asn lie Pro Met Leu Tyr Val Ser His Ser Leu Asp Glu lie 
180 185 190 

etc cat ctg gca gac aga gtg atg gta ctg gaa aac ggt cag gtg aaa 624 
Leu His Leu Ala Asp Arg Val Met Val Leu Glu Asn Gly Gin Val Lys 
195 200 205 

gee ttt ggc gcg ctg gag gaa gtg tgg ggc agt age gtg atg aat ccg 672 
Ala Phe Gly Ala Leu Glu Glu Val Trp Gly Ser Ser Val Met Asn Pro 
210 215 220 

tgg ctg ccg aaa gag caa caa agt age att ctg aaa gtg acg gtg ctt 720 
Trp Leu Pro Lys Glu Gin Gin Ser Ser He Leu Lys Val Thr Val Leu 
225 230 235 240 

gag cat cat ccg cat tac gcg atg acc gcg ctg gcg ctg ggc gat cag 768 
Glu His His Pro His Tyr Ala Met Thr Ala Leu Ala Leu Gly Asp Gin 
245 250 255 

cat ttg tgg gtc aat aag ctg gac gaa ccg ctg caa get gcg eta cgt 816 
His Leu Trp Val Asn Lys Leu Asp Glu Pro Leu Gin Ala Ala Leu Arg 
260 265 270 

ate cgc att cag get tec gat gtt tct ctg gtt tta caa ccg ccg cag 864 
lie Arg He Gin Ala Ser Asp Val Ser Leu Val Leu Gin Pro Pro Gin 
275 280 285 

caa acc age att cgt aac gta ttg egg gca aaa gtt gtt aat agt tat 912 
Gin Thr Ser He Arg Asn Val Leu Arg Ala Lys Val Val Asn Ser Tyr 
290 295 300 

gac gac aac ggc cag gtg gaa gtg gaa ctg gaa gtc ggc ggt aaa acg 960 
Asp Asp Asn Gly Gin Val Glu Val Glu Leu Glu Val Gly Gly Lys Thr 
305 310 315 320 

ctg tgg gcg cgt ate age ccg tgg gec agg gat gaa ctg gcg ate aaa 1008 
Leu Trp Ala Arg He Ser Pro Trp Ala Arg Asp Glu Leu Ala He Lys 
325 330 335 

cct ggc ctg tgg ctg tac gcg caa- att aaa agt gtg teg ata acc gee 1056 
Pro Gly Leu Trp Leu Tyr Ala Gin He Lys Ser Val Ser He Thr Ala 
340 345 350 



tga 



1059 



<210> 278 
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<211> 507 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . . (507) 

<400> 278 

gtg aac tec gtc att aca caa aaa gtg tea tct ggc gtt aca ctt tat 48 

Met Asn Ser Val lie Thr Gin Lys Val Ser Ser Gly Val Thr Leu Tyr 
15 10 15 

gcg gat act aaa aca gga ggt ttt atg aac aga acg att ctt gtc cct 96 
Ala Asp Thr Lys Thr Gly Gly Phe Met Asn Arg Thr lie Leu Val Pro 
20 25 30 

ate gat att tec gat tea gaa tta act caa cgc gtg att age cac gtt 144 
lie Asp lie Ser Asp Ser Glu Leu Thr Gin Arg Val lie Ser His Val 
35 40 45 

gag gaa gag gca aag att gat gat gca gag gtt cat ttc ctg acg gta 192 
Glu Glu Glu Ala Lys lie Asp Asp Ala Glu Val His Phe Leu Thr Val 
50 55 60 

ata cct tea ctg ccc tac tat gee tct ctg ggt tta gcg tat tec gca 240 
lie Pro Ser Leu Pro Tyr Tyr Ala Ser Leu Gly Leu Ala Tyr Ser Ala 
65 70 75 80 

gaa tta ccg gca atg gat gac ctg aaa gcg gaa gec aaa teg caa ctg 288 
Glu Leu Pro Ala Met Asp Asp Leu Lys Ala Glu Ala Lys Ser Gin Leu 
85 90 95 

gaa gag ate att aaa aaa ttt aaa ctg cca ace gac aga gtg cat gtc 336 
Glu Glu lie lie Lys Lys Phe Lys Leu Pro Thr Asp Arg Val His Val 
100 105 110 

cat gtt gag gaa ggc teg ccc aaa gac cgc att ctg gaa ttg gcg aag 38 4 

His Val Glu Glu Gly Ser Pro Lys Asp Arg lie Leu Glu Leu Ala Lys 
115 120 125 

aag ate ccc get cat atg ate ate att get tec cat cga ccg gat ate 432 
Lys lie Pro Ala His Met lie lie lie Ala Ser His Arg Pro Asp lie 
130 135 140 

acc act tat ctg etc ggt tec aac gec gca get gta gtg cgt cac gca 480 
Thr Thr Tyr Leu Leu Gly Ser Asn Ala Ala Ala Val Val Arg His Ala 
145 150 155 160 

gag tgc tec gtg ctg gtt gtg cgc tga 507 
Glu Cys Ser Val Leu Val Val Arg * 
165 



<210> 279 
<211> 1134 
<212> DNA 

<213> Escherichia coli 
<220> 
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<221> CDS 

<222> (1) . . . (1134) 

<400> 279 

atg aaa age aaa gta ctg gca ctt tta att cct gec ctg etc gee gca 48 
Met Lys Ser Lys Val Leu Ala Leu Leu lie Pro Ala Leu Leu Ala Ala 
15 10 15 

ggt get gca cat gca gee gaa gtt tat aat aaa gac ggc aac aaa tta 96 
Gly Ala Ala His Ala Ala Glu Val Tyr Asn Lys Asp Gly Asn Lys Leu 
20 25 30 



gat ctg tat ggc aaa gtt gat ggc ctg cat tat ttt tct gat aat tea 
Asp Leu Tyr Gly Lys Val Asp Gly Leu His Tyr Phe Ser Asp Asn Ser 
35 40 45 



ctg cct gaa ttt ggc ggt gac tct tat ace aat gca gac aac ttt atg 
Leu Pro Glu Phe Gly Gly Asp Ser Tyr Thr Asn Ala Asp Asn Phe Met 
130 135 140 



ggt ctg gta aat ggt ctg aac ttc gcg gtg cag tat caa ggt aac aac 
Gly Leu Val Asn Gly Leu Asn Phe Ala Val Gin Tyr Gin Gly Asn Asn 
165 170 175 



cgc cat gaa aac ggt gac ggc tgg ggt ctt tec aca aca tat gat tta 
Arg His Glu Asn Gly Asp Gly Trp Gly Leu Ser Thr Thr Tyr Asp Leu 
195 200 205 

ggc atg ggc ttt age get ggt gcg gca tac ace tct tct gac cgc ace 
Gly Met Gly Phe Ser Ala Gly Ala Ala Tyr Thr Ser Ser Asp Arg Thr 
210 215 220 



144 



gcg aaa gat ggc gac cag age tat gcg cgt ctg ggt ttt aaa ggc gaa 192 
Ala Lys Asp Gly Asp Gin Ser Tyr Ala Arg Leu Gly Phe Lys Gly Glu 
50 ~ 55 60 

ace caa att aac gat caa etc act ggc tac ggt caa tgg gaa tac aat 240 
Thr Gin He Asn Asp Gin Leu Thr Gly Tyr Gly Gin Trp Glu Tyr Asn 
65 70 75 80 

att cag gca aac aac act gaa tct tea aaa aac cag tea tgg ace cgt 288 
He Gin Ala Asn Asn Thr Glu Ser Ser Lys Asn Gin Ser Trp Thr Arg 
85 90 95 

ctg gca ttt gee ggg ctg aaa ttt gca gat tac ggt tct ttc gat tac 336 
Leu Ala Phe Ala Gly Leu Lys Phe Ala Asp Tyr Gly Ser Phe Asp Tyr 
100 105 HO 

gga cgt aat tat ggc gta atg tac gac ate gaa ggc tgg ace gat atg 384 
Gly Arg Asn Tyr Gly Val Met Tyr Asp He Glu Gly Trp Thr Asp Met 
115 120 125 



432 



act ggt cga gee aat ggc gtc gcg act tat cgt aat act gat ttc ttc 480 
Thr Gly Arg Ala Asn Gly Val Ala Thr Tyr Arg Asn Thr Asp Phe Phe 
145 150 155 160 



528 



gaa gga gee agt aat ggt cag gaa ggc ace aac aac gga cgt gat gtt 576 
Glu Gly Ala Ser Asn Gly Gin Glu Gly Thr Asn Asn Gly Arg Asp Val 
180 185 190 



624 



672 
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aat gac cag gtt aac cat act gcg gcg ggt ggt gat aaa gca gac gcg 720 
Asn Asp Gin Val Asn His Thr Ala Ala Gly Gly Asp Lys Ala Asp Ala 
225 230 235 240 

tgg act get ggg eta aaa tac gat get aac aat att tac ctg gca ace 7 68 

Trp Thr Ala Gly Leu Lys Tyr Asp Ala Asn Asn lie Tyr Leu Ala Thr 
245 250 255 

atg tat tea gaa acg cgt aat atg acc ccg ttt ggc gac age gat tat 816 
Met Tyr Ser Glu Thr Arg Asn Met Thr Pro Phe Gly Asp Ser Asp Tyr 
260 265 270 

get gtg gca aac aaa acc cag aat ttt gaa gtc act gca cag tac cag 864 
Ala Val Ala Asn Lys Thr Gin Asn Phe Glu Val Thr Ala Gin Tyr Gin 
275 280 285 

ttt gat ttt ggt ctg cgt ccg gca gtc tct ttc ctg atg tct aaa ggc 912 
Phe Asp Phe Gly Leu Arg Pro Ala Val Ser Phe Leu Met Ser Lys Gly 
290 295 300 

cgt gac ctg cac get gcg ggt ggt gca gac aac ccg gca ggt gtt gat 960 
Arg Asp Leu His Ala Ala Gly Gly Ala Asp Asn Pro Ala Gly Val Asp 
305 * 310 315 320 

gat aaa gat ctg gtt aaa tac gec gat att ggc gcg act tac tat ttc 1008 
Asp Lys Asp Leu Val Lys Tyr Ala Asp lie Gly Ala Thr Tyr Tyr Phe 
325 330 335 

aat aaa aac atg tec acc tac gtt gac tat aaa ate aac ctg ttg gat 1056 
Asn Lys Asn Met Ser Thr Tyr Val Asp Tyr Lys lie Asn Leu Leu Asp 
340 345 350 

gaa gat gac age ttc tac get gee aat ggc ate tct acc gat gat att 1104 
Glu Asp Asp Ser Phe Tyr Ala Ala Asn Gly lie Ser Thr Asp Asp lie 
355 360 365 

gtc get tta ggt ctg gtt tat cag ttc taa 1134 
Val Ala Leu Gly Leu Val Tyr Gin Phe * 
370 375 



<210> 280 
<211> 3369 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . . (3369) 

<400> 280 

atg tta agg gtc tac cat tec aat cgt ctg gac gtg ctg gaa gcg ttg 4 8 

Met Leu Arg Val Tyr His Ser Asn Arg Leu Asp Val Leu Glu Ala Leu 
15 10 15 

atg gag ttt att gtc gaa cgc gaa egg ctg gac gat cct ttc gaa cca 96 
Met Glu Phe lie Val Glu Arg Glu Arg Leu Asp Asp Pro Phe Glu Pro 
20 25 30 

gag atg att ctg gtg caa agt acc ggt atg gca cag tgg ctg caa atg 14 4 
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Glu Met lie Leu Val Gin Ser Thr Gly Met Ala Gin Trp Leu Gin Met 
35 40 45 

acc ctg teg caa aag ttt ggt att gcg gca aac att gat ttt ccg ctg 192 
Thr Leu Ser Gin Lys Phe Gly He Ala Ala Asn He Asp Phe Pro Leu 
50 55 60 



cca gcg age ttt ate tgg gat atg ttc gtc egg gtg tta ccg gaa ate 
Pro Ala Ser Phe He Trp Asp Met Phe Val Arg Val Leu Pro Glu He 
65 70 75 80 



cag gcg eta cag gcg ctg ggt aaa cat att gaa ate cat etc ctg ttt 
Gin Ala Leu Gin Ala Leu Gly Lys His He Glu He His Leu Leu Phe 
225 230 235 240 



240 



ccc aaa gag age gec ttt aac aaa cag age atg age tgg aaa ctg atg 288 
Pro Lys Glu Ser Ala Phe Asn Lys Gin Ser Met Ser Trp Lys Leu Met 
85 90 95 

act ctg ctg ccg caa ttg ctg gag cgc gaa gac ttt acc ctg ttg egg 336 
Thr Leu Leu Pro Gin Leu Leu Glu Arg Glu Asp Phe Thr Leu Leu Arg 
100 105 110 

cat tat ctg act gac gat age gac aag cga aaa ctg ttc cag ctt tec 384 
His Tyr Leu Thr Asp Asp Ser Asp Lys Arg Lys Leu Phe Gin Leu Ser 
115 120 125 

tea aaa gcg gcg gac ctg ttt gac cag tat ctg gtc tat cgt ccg gac 432 
Ser Lys Ala Ala Asp Leu Phe Asp Gin Tyr Leu Val Tyr Arg Pro Asp 
130 135 140 

tgg ctg gca cag tgg gaa aca gga cat ttg gtt gaa ggg ctg gga gaa 480 
Trp Leu Ala Gin Trp Glu Thr Gly His Leu Val Glu Gly Leu Gly Glu 
145 150 155 160 

gca cag gec tgg caa gec ccg ttg tgg aag gcg ctg gtg gaa tat acc 528 
Ala Gin Ala Trp Gin Ala Pro Leu Trp Lys Ala Leu Val Glu Tyr Thr 
165 170 175 

cat caa etc ggg caa ccg cgc tgg cac cgc gec aat etc tat cag cgc 57 6 

His Gin Leu Gly Gin Pro Arg Trp His Arg Ala Asn Leu Tyr Gin Arg 
180 185 190 

ttt ate gaa acg ctg gag tec gcg acg acc tgc ccg ccg ggg tta cct 624 
Phe He Glu Thr Leu Glu Ser Ala Thr Thr Cys Pro Pro Gly Leu Pro 
195 200 205 

teg cgc gtc ttt ata tgc ggt att tec gcg tta ccg cct gtt tat etc 672 
Ser Arg Val Phe He Cys Gly He Ser Ala Leu Pro Pro Val Tyr Leu 
210 "* 215 220 



720 



acc aac ccc tgc cgt tat tac tgg ggc gat att aaa gat cct get tat 768 
Thr Asn Pro Cys Arg Tyr Tyr Trp Gly Asp He Lys Asp Pro Ala Tyr 
245 250 255 

ctg gcg aaa ctg ctg acc cgt cag cgt cga cac agt ttt gaa gat cgc 816 
Leu Ala Lys Leu Leu Thr Arg Gin Arg Arg His Ser Phe Glu Asp Arg 
260 265 270 



gaa ttg ccg eta ttt cgt gat age gaa aat gec ggg cag etc ttt aac 



864 
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Glu Leu Pro Leu Phe Arg Asp Ser Glu Asn Ala Gly Gin Leu Phe Asn 
275 280 285 

age gat ggt gaa cag gat gtc ggc aac ccg ctg ctg get tea tgg ggt 912 
Ser Asp Gly Glu Gin Asp Val Gly Asn Pro Leu Leu Ala Ser Trp Gly 
290 295 300 

aag ctt ggg cgc gac tac att tat etc ctt tct gac ctg gag age age 960 
Lys Leu Gly Arg Asp Tyr lie Tyr Leu Leu Ser Asp Leu Glu Ser Ser 
305 310 315 320 

cag gag ctg gac gee ttt gtc gat gtc acg cca gat aac ctg ctg cat 1008 
Gin Glu Leu Asp Ala Phe Val Asp Val Thr Pro Asp Asn Leu Leu His 
325 330 335 

aac att cag tct gac att ctg gaa ctg gaa aac cgc gee gtt get ggt 1056 
Asn lie Gin Ser Asp lie Leu Glu Leu Glu Asn Arg Ala Val Ala Gly 
340 345 350 

gtg aac ate gaa gag ttt tec cgt age gat aac aaa cgc ccg ctt gat 1104 
Val Asn lie Glu Glu Phe Ser Arg Ser Asp Asn Lys Arg Pro Leu Asp 
355 360 365 

cca ctg gat age agt ate acc ttc cac gtt tgc cat age ccg cag cgt 1152 
Pro Leu Asp Ser Ser lie Thr Phe His Val Cys His Ser Pro Gin Arg 
370 375 380 

gaa gtt gaa gtt tta cac gat cgc ctg ctg gcg atg ctg gag gaa gac 1200 
Glu Val Glu Val Leu His Asp Arg Leu Leu Ala Met Leu Glu Glu Asp 
385 390 395 400 

ccg aca ctt act ccg cgc gac ate ate gtg atg gtg get gat ate gac 1248 
Pro Thr Leu Thr Pro Arg Asp lie lie Val Met Val Ala Asp He Asp 
405 ~ 410 * 415 

age tac agt ccg ttt att cag get gtg ttt ggt age gca cct gcg gat 1296 
Ser Tyr Ser Pro Phe He Gin Ala Val Phe Gly Ser Ala Pro Ala Asp 
420 425 430 

cgt tac eta cct tac gee att tec gac cgt cgt gcg egg cag tea cat 1344 
Arg Tyr Leu Pro Tyr Ala He Ser Asp Arg Arg Ala Arg Gin Ser His 
435 440 445 

ccg gta ctg gaa gcg ttt ate age ctg tta tea ctg cct gac agt cgt 1392 
Pro Val Leu Glu Ala Phe He Ser Leu Leu Ser Leu Pro Asp Ser Arg 
450 455 460 

ttt gtg tea gag gat gtg ctg gcg ttg ctg gat gtg ccg gtg ctg gcg 1440 
Phe Val Ser Glu Asp Val Leu Ala Leu Leu Asp Val Pro Val Leu Ala 
465 470 475 480 

gcg egg ttt gac ate acc gaa gaa ggg ctg cgt tat tta cgc cag tgg 1488 
Ala Arg Phe Asp He Thr Glu Glu Gly Leu Arg Tyr Leu Arg Gin Trp 
485 490 495 

gtc aac gaa tec ggc att cgt tgg ggc ata gat gac gac aac gtt cgc 1536 
Val Asn Glu Ser Gly lie Arg Trp Gly He Asp Asp Asp Asn Val Arg 
500 ~ " 505 510 

gag ctg gaa etc ccc gee acc gga caa cac acc tgg cga ttt ggc ctg 1584 
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Glu Leu Glu Leu Pro Ala Thr Gly Gin His Thr Trp Arg Phe Gly Leu 
515 520 525 

acg cgt atg ttg ttg ggc tac gcg atg gag age gcg cag ggc gag tgg 1632 
Thr Arg Met Leu Leu Gly Tyr Ala Met Glu Ser Ala Gin Gly Glu Trp 
530 535 540 

caa teg gtt eta cct tat gat gaa teg age ggc tta att gca gaa ctg 1680 
Gin Ser Val Leu Pro Tyr Asp Glu Ser Ser Gly Leu lie Ala Glu Leu 
545 550 555 560 

gtg ggg cat ctg get tea ctg eta atg cag eta aac ate tgg cgt cgc 1728 
Val Gly His Leu Ala Ser Leu Leu Met Gin Leu Asn lie Trp Arg Arg 
565 570 575 

ggg ctg gca cag gag cgt ccg ctg gaa gag tgg ttg ccg gtt tgt cgc 1776 
Gly Leu Ala Gin Glu Arg Pro Leu Glu Glu Trp Leu Pro Val Cys Arg 
580 585 590 

gat atg etc aac gee ttc ttc ctg ccg gat gcg gaa ace gaa gcg gcg 1824 
Asp Met Leu Asn Ala Phe Phe Leu Pro Asp Ala Glu Thr Glu Ala Ala 
595 600 605 

atg acg ctg ate gaa caa caa tgg cag gcg att ate gec gaa ggt tta 1872 
Met Thr Leu He Glu Gin Gin Trp Gin Ala lie He Ala Glu Gly Leu 
610 615 620 

ggt gcg cag tat ggc gac gcg gtg ccg ctg tea eta ttg cgt gat gaa 1920 
Gly Ala Gin Tyr Gly Asp Ala Val Pro Leu Ser Leu Leu Arg Asp Glu 
625 630 635 640 

ctg gca cag cgt ctg gat caa gaa cgt ate age cag cgt ttt etc gec 1968 
Leu Ala Gin Arg Leu Asp Gin Glu Arg He Ser Gin Arg Phe Leu Ala 
645 650 655 

gga ccg gtt aac att tgt act ctg atg cca atg cgt tea att ccg ttc 2016 
Gly Pro Val Asn He Cys Thr Leu Met Pro Met Arg Ser He Pro Phe 
660 665 670 

aaa gtg gtt tgc ctg ctg gga atg aac gac ggc gtt tat cca cgt cag 2064 
Lys Val Val Cys Leu Leu Gly Met Asn Asp Gly Val Tyr Pro Arg Gin 
675 680 685 

ctt gcg cca ttg ggc ttt gac ctg atg age cag aaa ccg aag cgt ggc 2112 
Leu Ala Pro Leu Gly Phe Asp Leu Met Ser Gin Lys Pro Lys Arg Gly 
690 695 700 

gac cgt age cgt cgc gat gac gac cgc tat ctg ttc ctg gaa gcg tta 2160 
Asp Arg Ser Arg Arg Asp Asp Asp Arg Tyr Leu Phe Leu Glu Ala Leu 
705 710 715 720 

att tec gcg cag caa aaa etc tat ate age tat ate ggt cgt tec att 2208 
He Ser Ala Gin Gin Lys Leu Tyr He Ser Tyr He Gly Arg Ser He 
725 730 735 

cag gat aac agt gag cgt ttc ccg teg gta ctg gtg cag gaa ctg ate 2256 
Gin Asp Asn Ser Glu Arg Phe Pro Ser Val Leu Val Gin Glu Leu lie 
740 745 750 

gac tac ate ggg caa agt cat tat eta ccg ggc gat gaa gcg etc aac 2304 
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Asp Tyr lie Gly Gin Ser His Tyr Leu Pro Gly Asp Glu Ala Leu Asn 
755 760 765 

tgt gat gaa age gag gca agg gta aaa gcg cat ctt act tgc etc cat 2352 
Cys Asp Glu Ser Glu Ala Arg Val Lys Ala His Leu Thr Cys Leu His 
770 775 780 

acc egg atg ccg ttt gat cca caa aac tac cag cca ggc gaa cga caa 2400 
Thr Arg Met Pro Phe Asp Pro Gin Asn Tyr Gin Pro Gly Glu Arg Gin 
785 790 795 800 

age tat get cgt gaa tgg eta cct gcg gee age cag get ggt aaa gca 2448 
Ser Tyr Ala Arg Glu Trp Leu Pro Ala Ala Ser Gin Ala Gly Lys Ala 
805 810 815 

cat tct gaa ttt gtt cag ccg ctg ccg ttt acc tta ccg gaa acc gtg 24 96 

His Ser Glu Phe Val Gin Pro Leu Pro Phe Thr Leu Pro Glu Thr Val 
820 825 830 

ccg ctg gaa acg eta caa cga ttc tgg gca cat ccg gtg egg gca ttt 254 4 

Pro Leu Glu Thr Leu Gin Arg Phe Trp Ala His Pro Val Arg Ala Phe 
835 840 845 

ttc cag atg cgt ttg cag gtg aac ttc cgt act gaa gac age gaa ate 2592 
Phe Gin Met Arg Leu Gin Val Asn Phe Arg Thr Glu Asp Ser Glu lie 
850 855 860 

ccc gac acc gag cca ttt att ctg gaa gga ctt age cgt tat caa ate 2640 
Pro Asp Thr Glu Pro Phe He Leu Glu Gly Leu Ser Arg Tyr Gin He 
865 870 875 880 

aat cag cag tta ttg aat gca ctg gtt gag cag gat gat gee gaa cgc 2688 
Asn Gin Gin Leu Leu Asn Ala Leu Val Glu Gin Asp Asp /Via Glu Arg 
885 890 895 

ttg ttc cgc cgc ttc cga gcg gca ggg gat tta ccg tat ggc get ttt 2736 
Leu Phe Arg Arg Phe Arg Ala Ala Gly Asp Leu Pro Tyr Gly Ala Phe 
900 " 905 ~ 910 

ggt gaa att ttc tgg gaa aca cag tgc cag gag atg cag cag ctt gee 2784 
Gly Glu He Phe Trp Glu Thr Gin Cys Gin Glu Met Gin Gin Leu Ala 
915 920 925 

gac aga gtc att gee tgt cgc cag ccg ggg cag agt atg gaa att gat 2832 

Asp Arg Val He Ala Cys Arg Gin Pro Gly Gin Ser Met Glu He Asp 
930 935 940 

< 

etc gee tgc aac ggt gtg cag ata act ggc tgg ttg ccg cag gtg cag 2880 

Leu Ala Cys Asn Gly Val Gin He Thr Gly Trp Leu Pro Gin Val Gin 
945 950 955 960 

ccg gat ggc ctg ttg cgc tgg cgt ccc tct tta tta agt gtg gee cag 2928 
Pro Asp Gly Leu Leu Arg Trp Arg Pro Ser Leu Leu Ser Val Ala Gin 
965 970 975 

gga atg caa ctt tgg ctg gaa cac ctt gtc tac tgt gec age ggt ggt 297 6 

Gly Met Gin Leu Trp Leu Glu His Leu Val Tyr Cys Ala Ser Gly Gly 
980 985 990 

aat ggt gaa agt cgc ctt ttt eta cgc aaa gac ggc gag tgg cgt ttt 3024 
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Asn Gly Glu Ser Arg Leu Phe Leu Arg Lys Asp Gly Glu Trp Arg Phe 
995 1000 1005 

ccg ccg ctt gca gcc gaa cag get ttg cat tac etc tea caa ctg att 3072 
Pro Pro Leu Ala Ala Glu Gin Ala Leu His Tyr Leu Ser Gin Leu lie 
1010 1015 1020 

gag ggg tat cgt gaa gga atg tec gcg cca ttg ctg gtg tta cct gaa 3120 
Glu Gly Tyr Arg Glu Gly Met Ser Ala Pro Leu Leu Val Leu Pro Glu 
1025 1030 1035 1040 

agt ggc ggc gcg tgg eta aaa acc tgt tat gac gcg caa aac gat gcc 3168 
Ser Gly Gly Ala Trp Leu Lys Thr Cys Tyr Asp Ala Gin Asn Asp Ala 
1045 1050 1055 

atg ctg gat gac gat tec acg ttg caa aaa gcc cgt acg aaa ttc ctt 3216 
Met Leu Asp Asp Asp Ser Thr Leu Gin Lys Ala Arg Thr Lys Phe Leu 
1060 1065 1070 

cag get tac gaa ggc aac atg atg gtg cgt ggc gaa ggt gat gat ate 3264 
Gin Ala Tyr Glu Gly Asn Met Met Val Arg Gly Glu Gly Asp Asp lie 
1075 1080 1085 

tgg tat caa agg etc tgg egg caa tta aca cca gag aca atg gag gcc 3312 
Trp Tyr Gin Arg Leu Trp Arg Glri Leu Thr Pro Glu Thr Met Glu Ala 
1090 1095 1100 

ate gtt gaa cag teg caa cgt ttc ctg tta ccg ctg ttt cgc ttt aat 3360 
lie Val Glu Gin Ser Gin Arg Phe Leu Leu Pro Leu Phe Arg Phe Asn 
1105 1110 1115 1120 

cag tea tga 3369 
Gin Ser * 



<210> 281 
<211> 324 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . . (324) 

<400> 281 

atg tea get tec ctg aag aat caa caa ggc ttt age ctg ccg gag gta 48 
Met Ser Ala Ser Leu Lys Asn Gin Gin Gly Phe Ser Leu Pro Glu Val 

1 5 10 - 15 . 

atg ttg gcg atg gtg ttg atg gtg atg att gtc act gcg tta teg ggt 96 
Met Leu Ala Met Val Leu Met Val Met lie Val Thr Ala Leu Ser Gly 
20 25 30 

ttc cag cga aca tta atg aac agt ctt gcc age aga aac cag tac caa 144 
Phe Gin Arg Thr Leu Met Asn Ser Leu Ala Ser Arg Asn Gin Tyr Gin 
35 40 45 

cag etc tgg egg cat ggc tgg cag caa acg caa ctg cgc gcg att teg 192 
Gin Leu Trp Arg His Gly Trp Gin Gin Thr Gin Leu Arg Ala lie Ser 
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50 55 60 

cca cct gcc aac tgg cag gtc aac cga atg cag aca teg cag gcg gga 240 
Pro Pro Ala Asn Trp Gin Val Asn Arg Met Gin Thr Ser Gin Ala Gly 
65 70 75 80 

tgt gtc age ate age gtt acg eta gtt tea ccc ggg ggc aga gaa ggc 288 
Cys Val Ser He Ser Val Thr Leu Val Ser Pro Gly Gly Arg Glu Gly 
85 90 95 

gag atg ace cgc ctg cat tgc ccg aat cgt cag tag 324 
Glu Met Thr Arg Leu His Cys Pro Asn Arg Gin * 
100 105 



<210> 282 
<211> 366 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . . (366) 

<400> 282 

atg ctg ctg gtt ttg ggt agt ttg eta tta caa gga atg age caa cag 48 
Met Leu Leu Val Leu Gly Ser Leu Leu Leu Gin Gly Met Ser Gin Gin 
15 10 15 

gat cgc agt ttt gcc tct cgc gtg age atg gaa agt cag tea ttg cgc 96 
Asp Arg Ser Phe Ala Ser Arg Val Ser Met Glu Ser Gin Ser Leu Arg 
20 25 30 

cgc cag gcc ate gtt cag teg gcg ctg gcg tgg gga aaa atg cac tgc 14 4 

Arg Gin Ala lie Val Gin Ser Ala Leu Ala Trp Gly Lys Met His Cys 
35 40 45 

tgg cag acg cag cca gca gtt cag tgc teg cag tac get gaa ace gat 192 
Trp Gin Thr Gin Pro Ala Val Gin Cys Ser Gin Tyr Ala Glu Thr Asp 
50 55 60 

gcc cag gtt tgt ttg cgt tta ctg gca gat aat gaa gcc tta ttg att 240 
Ala Gin Val Cys Leu Arg Leu Leu Ala Asp Asn Glu Ala Leu Leu lie 
65 70 75 80 

gcc ggt tat gaa ggc gtt teg ttg tgg cga aca ggc gaa gtc att gat 288 
Ala Gly Tyr Glu Gly Val Ser Leu Trp Arg Thr Gly Glu Val He Asp 
85 90 95 

gga aac att gtt ttt teg cca cgc ggc tgg age gat ttt tgt ccg ctg 336 
Gly Asn He Val Phe Ser Pro Arg Gly Trp Ser Asp Phe Cys Pro Leu 
100 105 110 

aaa gag agg gcg tta tgt cag ctt ccc tga 366 
Lys Glu Arg Ala Leu Cys Gin Leu Pro * 
115 120 



<210> 283 
<211> 564 
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<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . . (564) 

<400> 283 

atg cct gta aaa gag caa ggt ttt tct ctg ctg gaa gtg ttg att get 4 8 

Met Pro Val Lys Glu Gin Gly Phe Ser Leu Leu Glu Val Leu lie Ala 
15 10 15 

atg gcg ate agt age gta ttg ttg ctg ggg gcg gca cgc ttt ctg cct 96 
Met Ala lie Ser Ser Val Leu Leu Leu Gly Ala Ala Arg Phe Leu Pro 
20 25 30 

gcg tta cag cgt gaa agt tta acg age acc cgt aag ctg gcg ctg gaa 14 4 

Ala Leu Gin Arg Glu Ser Leu Thr Ser Thr Arg Lys Leu Ala Leu Glu 
35 40 45 

gat gaa ate tgg ctg egg gta ttt acc gtc gcg aag cat etc cag agg 192 
Asp Glu lie Trp Leu Arg Val Phe Thr Val Ala Lys His Leu Gin Arg 
50 55 60 

gcg ggt tat tgt cat ggc ate tgt acc ggc gaa ggg ctg gaa att gtc 24 0 

Ala Gly Tyr Cys His Gly He Cys Thr Gly Glu Gly Leu Glu He Val 
65 70 75 80 

gga cag ggt gac tgt gtc att gtg cag tgg gat gcg aac agt aac ggt 288 
Gly Gin Gly Asp Cys Val He Val Gin Trp Asp Ala Asn Ser Asn Gly 
85 90 95 

ate tgg gat cgc gaa ccg gta aaa gag tec gac cag att gga ttt cgt 336 
He Trp Asp Arg Glu Pro Val Lys Glu Ser Asp Gin He Gly Phe Arg 
100 105 110 

ctg aag gag cat gtg ctg gaa acg eta cgc ggt gcg aca tec tgt gaa 384 
Leu Lys Glu His Val Leu Glu Thr Leu Arg Gly Ala Thr Ser Cys Glu 
115 120 125 

ggt aag ggc tgg gat aaa gtc act aat ccg gat gee ate att ate gac 432 
Gly Lys Gly Trp Asp Lys Val Thr Asn Pro Asp Ala He He lie Asp 
130 135 140 

act ttt cag gtc gta cgt cag gat gtc age ggc ttc teg ccg gtg ttg 480 
Thr Phe Gin Val Val Arg Gin Asp Val Ser Gly Phe Ser Pro Val Leu 
145 150 155 160 

acg gtt aat atg cgt get gee agt aag tct gaa ccg caa acc gtg gtg 528 
Thr Val Asn Met Arg Ala Ala Ser Lys Ser Glu Pro Gin Thr Val Val 
165 170 175 

aat gee age tat age gtg aca gga ttc aac ctg tga 564 
Asn Ala Ser Tyr Ser Val Thr Gly Phe Asn Leu * 
180 185 



<210> 284 
<211> 471 
<212> DNA 
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<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1)...(471) 

<400> 284 

atg aaa aca caa cgt ggt tat acg ctg att gaa acg ctg gtc gcg atg 48 

Met Lys Thr Gin Arg Gly Tyr Thr Leu He Glu Thr Leu Val Ala Met 

15 10 15 

ctg att ttg gtc atg eta age gca agt gga etc tat ggc tgg caa tac 96 
Leu He Leu Val Met Leu Ser Ala Ser Gly Leu Tyr Gly Trp Gin Tyr 
20 25 30 

tgg cag cag teg caa egg eta tgg caa acc gee age cag gcg egg gac 144 
Trp Gin Gin Ser Gin Arg Leu Trp Gin Thr Ala Ser Gin Ala Arg Asp 
35 40 45 

tat ttg etc tat tta cgt gaa gat gee aac tgg cat aac cgc gac cac 192 
Tyr Leu Leu Tyr Leu Arg Glu Asp Ala Asn Trp His Asn Arg Asp His 
50 55 60 

agt ate age gtt ate agg gag ggg acg tta tgg tgc ctt gtg agt tec 240 
Ser He Ser Val He Arg Glu Gly Thr Leu Trp Cys Leu Val Ser Ser 
65 70 75 80 

get get ggg gec aat acc tgt cat ggc agt tea cca ttg gtc ttt gtg 288 
Ala Ala Gly Ala Asn Thr Cys His Gly Ser Ser Pro Leu Val Phe Val 
85 90 95 

cca cgc tgg ccc gaa gtc gaa atg age gac ctg aca cct teg ctt get 336 
Pro Arg Trp Pro Glu Val Glu Met Ser Asp Leu Thr Pro Ser Leu Ala 
100 105 110 

ttc ttt ggc ctg cgc aat acc gca tgg gee ggg cat att cgc ttc aaa 384 
Phe Phe Gly Leu Arg Asn Thr Ala Trp Ala Gly His lie Arg Phe Lys 
115 120 125 

aac tea acg ggc gag tgg tgg ctg gtg gtt teg ccg tgg gga aga etc 4 32 

Asn Ser Thr Gly Glu Trp Trp Leu Val Val Ser Pro Trp Gly Arg Leu 
130 135 140 

egg ctt tgt cag caa gga gaa aca gaa gga tgc ctg taa 4 71 

Arg Leu Cys Gin Gin Gly Glu Thr Glu Gly Cys Leu * 
145 150 155 



<210> 285 
<211> 2136 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . . (2136) 

<400> 285 

atg age acc att gtg att ttt tta get get ttg ctg gee tgc tea eta 48 
Met Ser Thr He Val He Phe Leu Ala Ala Leu Leu Ala Cys Ser Leu 
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15 10 15 

ctt gcg gga tgg ctg ata aaa gtg cga tec aga egg cgt cag ctg ccc 96 

Leu Ala Gly Trp Leu He Lys Val Arg Ser Arg Arg Arg Gin Leu Pro 
20 25 30 

tgg acc aac gee ttc gcg gat gcg caa acg cgt aaa etc aca cct gaa 14 4 

Trp Thr Asn Ala Phe Ala Asp Ala Gin Thr Arg Lys Leu Thr Pro Glu 
35 40 45 

gaa cgt age gec gtt gaa aat tat ctt gag age ctg acg cag gta tta 192 

Glu Arg Ser Ala Val Glu Asn Tyr Leu Glu Ser Leu Thr Gin Val Leu 

50 55 60 

cag gtg cct ggc cca acg gga gec age gcg gca ccg ate tct ctg gcg 240 

Gin Val Pro Gly Pro Thr Gly Ala Ser Ala Ala Pro He Ser Leu Ala 

65 70 75 80 

ctg aat gec gaa age aac aac gtc atg atg ctg aca cac get ate acg 288 

Leu Asn Ala Glu Ser Asn Asn Val Met Met Leu Thr His Ala He Thr 
85 90 95 

cgt tac ggc ate tct acc gac gat ccg aat aaa tgg cgt tac tac etc 336 

Arg Tyr Gly He Ser Thr Asp Asp Pro Asn Lys Trp Arg Tyr Tyr Leu 
100 105 110 

gat teg gta gaa gtc cac ctg ccc cct ttc tgg gaa cag tac ate aac 384 

Asp Ser Val Glu Val His Leu Pro Pro Phe Trp Glu Gin Tyr He Asn 
115 120 125 

gat gag aat acc gtt gaa ctg att cat acc gat teg ctg ccg ctg gtt 4 32 

Asp Glu Asn Thr Val Glu Leu He His Thr Asp Ser Leu Pro Leu Val 

130 135 140 

att tea etc aac ggt cat acg ctg cag gag tac atg cag gaa act cgc 4 80 

He Ser Leu Asn Gly His Thr Leu Gin Glu Tyr Met Gin Glu Thr Arg 

145 150 155 160 

age tat gee ttg caa cct gtt ccg tea acg cag gcg teg att cgc ggg 528 

Ser Tyr Ala Leu Gin Pro Val Pro Ser Thr Gin Ala Ser lie Arg Gly 
165 170 175 

gaa gag agt gag caa ate gag eta etc aat att cgc aaa gaa acg cat 57 6 

Glu Glu Ser Glu Gin lie Glu Leu Leu Asn lie Arg Lys Glu Thr His 
180 185 190 

gaa gaa tat gcg ctg agt cgt ccg cgc ggg ctg cgt gaa gcg ttg ctg 624 

Glu Glu Tyr Ala Leu Ser Arg Pro Arg Gly Leu Arg Glu Ala Leu Leu 
195 200 205 

ate gtc gee tec ttc ctg atg ttc ttt ttc tgc ctg att acc ccg gat 672 

He Val Ala Ser Phe Leu Met Phe Phe Phe Cys Leu He Thr Pro Asp 

210 215 220 

gta ttt gtt ccg tgg ctg gca ggc ggc gcg tta ctg ctg ctg ggc gca 720 

Val Phe Val Pro Trp Leu Ala Gly Gly Ala Leu Leu Leu Leu Gly Ala 

225 230 235 240 

ggt ctg tgg ggg eta ttc gcg ccc ccg gca aaa tec tec ctg egg gaa 768 

Gly Leu Trp Gly Leu Phe Ala Pro Pro Ala Lys Ser Ser Leu Arg Glu 
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245 250 255 

att cat tgt ctg cgc ggt aca ccc cgt cgt tgg gga ttg ttt ggc gaa 816 
lie His Cys Leu Arg Gly Thr Pro Arg Arg Trp Gly Leu Phe Gly Glu 
260 265 270 

aac gat cag gaa cag ate aac aat att tea etc ggt att ate gac ctg 864 
Asn Asp Gin Glu Gin lie Asn Asn He Ser Leu Gly He lie Asp Leu 
275 280 285 

gtc tat ccc gca cac tgg cag cca tac att get cag gat etc ggt caa 912 
Val Tyr Pro Ala His Trp Gin Pro Tyr He Ala Gin Asp Leu Gly Gin 
290 295 300 

caa ace gat ate gat ate tat etc gac cgc cat gta gtg cgt cag gga 960 
Gin Thr Asp He Asp He Tyr Leu Asp Arg His Val Val Arg Gin Gly 
305 ' 310 315 320 

cga tat ctt teg ctg cat gat gaa gta aaa aac ttt ccg tta cag cac 1008 
Arg Tyr Leu Ser Leu His Asp Glu Val Lys Asn Phe Pro Leu Gin His 
325 330 335 

tgg ctg cgc agt acg att ate get gcg ggc teg ctg ctg gtg ctg ttt 1056 
Trp Leu Arg Ser Thr He He Ala Ala Gly Ser Leu Leu Val Leu Phe 
340 345 350 

atg ctg tta ttc tgg ate ccg ctg gat atg ccg ctg aaa ttc act etc 1104 
Met Leu Leu Phe Trp He Pro Leu Asp Met Pro Leu Lys Phe Thr Leu 
355 360 365 

tea tgg atg aaa ggc gcg cag ace att gaa gee ace age gta aaa caa 1152 
Ser Trp Met Lys Gly Ala Gin Thr He Glu Ala Thr Ser Val Lys Gin 
370 375 380 

ctg get gac get ggc gtg egg gta ggc gat aca ttg cgt att age ggt 1200 
Leu Ala Asp Ala Gly Val Arg Val Gly Asp Thr Leu Arg lie Ser Gly 
385 390 395 400 

acg gga atg tgt aat att cga act tec gga ace tgg age gcg aaa acc 1248 
Thr Gly Met Cys Asn He Arg Thr Ser Gly Thr Trp Ser Ala Lys Thr 
405 410 415 

aat tea cct ttt tta ccg ttt gac tgc teg cag ate ate tgg aac gac 1296 
Asn Ser Pro Phe Leu Pro Phe Asp Cys Ser Gin He He Trp Asn Asp 
420 425 430 

gec cgc tea ttg cca tta cca gaa tct gaa ctg gtc aac aaa gcg acg 1344 
Ala Arg Ser Leu Pro Leu Pro Glu Ser Glu Leu Val Asn Lys Ala Thr 
435 440 445 

gca ttg act gaa gca gtt aat cgc cag ctg cac cct aaa ccg gaa gat 1392 
Ala Leu Thr Glu Ala Val Asn Arg Gin Leu His Pro Lys Pro Glu Asp 
450 455 460 

gaa tct cgc gtc agt gec tea tta cgt tea gca att caa aaa tec ggc 1440 
Glu Ser Arg Val Ser Ala Ser Leu Arg Ser Ala He Gin Lys Ser Gly 
465 470 475 480 

atg gta ttg ctt gat gat ttt ggc gac att gta ctg aag aca gcg gat 1488 
Met Val Leu Leu Asp Asp Phe Gly Asp He Val Leu Lys Thr Ala Asp 
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485 490 495 

tta tgt tct gcc aaa gat gac tgt gtg cga ctg aaa aat gcg ctg gtc 1536 
Leu Cys Ser Ala Lys Asp Asp Cys Val Arg Leu Lys Asn Ala Leu Val 
500 505 510 

aat etc ggc aac agt aaa gac tgg gac gcg ctg gta aaa cgc gcc aac 1584 
Asn Leu Gly Asn Ser Lys Asp Trp Asp Ala Leu Val Lys Arg Ala Asn 
515 520 525 

gcc ggg aag etc gat ggc gtg aat gtg tta tta cgc ccg gtg agt gcg 1632 
Ala Gly Lys Leu Asp Gly Val Asn Val Leu Leu Arg Pro Val Ser Ala 
530 " 535 540 

gaa teg ctg gat aac ctg gtg gca acc tec acc gcg ccg ttt ate acg 1680 
Glu Ser Leu Asp Asn Leu Val Ala Thr Ser Thr Ala Pro Phe lie Thr 
545 550 555 560 

cat gaa acg gcg cga gcg gca caa tea eta aac agt ccg gcc ccc ggc 1728 
His Glu Thr Ala Arg Ala Ala Gin Ser Leu Asn Ser Pro Ala Pro Gly 
565 570 575 

gga ttc ctg att gtc age gac gaa ggc age gat ttt gtt gat cag ccc . 1776 
Gly Phe Leu He Val Ser Asp Glu Gly Ser Asp Phe Val Asp Gin Pro 
580 585 590 

tgg cct teg gca tea ctt tac gac tac ccg ccg caa gaa cag tgg aac 1024 
Trp Pro Ser Ala Ser Leu Tyr Asp Tyr Pro Pro Gin Glu Gin Trp Asn 
595 600 605 



get ttc cag aaa ctg gca caa atg ctg atg cat acg ccg ttt aac gcc 
Ala Phe Gin Lys Leu Ala Gin Met Leu Met His Thr Pro Phe Asn Ala 
610 615 620 



1872 



gag ggt ate gtc aca aaa ate ttc act gac gcc aat ggt acg cag cat 1920 
Glu Gly He Val Thr Lys He Phe Thr Asp Ala Asn Gly Thr Gin His 
625 * 630 635 640 

att ggc ctt cat ccg ate ccg gat cgt tec ggc ctg tgg cgc tat etc 1968 
He Gly Leu His Pro He Pro Asp Arg Ser Gly Leu Trp Arg Tyr Leu 
645 650 655 

age acc aca ttg ctg eta ctg acg atg ctg ggt age gcc att tac aat 2016 
Ser Thr Thr Leu Leu Leu Leu Thr Met Leu Gly Ser Ala He Tyr Asn 
660 665 670 

ggc gta cag gcc tgg cgt cgt tac cag cgt cat cgc act cgc atg atg 2064 
Gly Val Gin Ala Trp Arg Arg Tyr Gin Arg His Arg Thr Arg Met Met 
675 680 685 

gag att cag gcc tat tat gaa age tgc ctg aac ccg caa ctg ate acc 2112 
Glu He Gin Ala Tyr Tyr Glu Ser Cys Leu Asn Pro Gin Leu He Thr 
690 695 700 

cct tea gaa age ctt ate gaa taa 2136 
Pro Ser Glu Ser Leu He Glu * 
705 710 
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<211> 714 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . . (714) 

<400> 286 

atg eta ccc tgt cgc gca aat tgc ttc act ctg gag att tec etc atg 48 

Met Leu Pro Cys Arg Ala Asn Cys Phe Thr Leu Glu lie Ser Leu Met 
15 10 15 

cat ate aac att gee tgg cag gac gta gat acc gtt ctg ctg gat atg 96 
His lie Asn lie Ala Trp Gin Asp Val Asp Thr Val Leu Leu Asp Met 
20 25 30 

gac ggc acg ttg etc gac etc gee ttc gat aac tat ttc tgg caa aag 144 
Asp Gly Thr Leu Leu Asp Leu Ala Phe Asp Asn Tyr Phe Trp Gin Lys 
35 40 45 

ctg gtg cct gaa aca tgg ggc gcg aaa aac ggg gtt acg cca cag gaa 192 
Leu Val Pro Glu Thr Trp Gly Ala Lys Asn Gly Val Thr Pro Gin Glu 
50 55 ' 60 

gcg atg gaa tat atg cgc cag caa tat cac gac gta cag cat acg eta 240 
Ala Met Glu Tyr Met Arg Gin Gin Tyr His Asp Val Gin His Thr Leu 
65 70 75 80 

aac tgg tac tgt ctt gat tac tgg agt gag caa ctg ggt ctg gat ate 288 
Asn Trp Tyr Cys Leu Asp Tyr Trp Ser Glu Gin Leu Gly Leu Asp lie 
85 90 95 

tgt gcg atg acc acc gag atg gga ccg cgt gee gta ctg cgt gaa gat 336 
Cys Ala Met Thr Thr Glu Met Gly Pro Arg Ala Val Leu Arg Glu Asp 
100 105 " 110 

acc att ccg ttt ctt gag gca ctg aaa gee age ggt aag cag cga att 384 
Thr lie Pro Phe Leu Glu Ala Leu Lys Ala Ser Gly Lys Gin Arg lie 
115 120 125 

ttg etc acc aat gcg cat ccg cac aac ctg gcg gta aaa ctt gag cat 4 32 

Leu Leu Thr Asn Ala His Pro His Asn Leu Ala Val Lys Leu Glu His 
130 135 140 

acc ggt ctg gac gca cac ctt gat tta tta ctt tec acc cac aca ttt 480 
Thr Gly Leu Asp Ala His Leu Asp Leu Leu Leu Ser Thr His Thr Phe 
145 150 155 160 

ggt tat ccg aaa gag gat cag egg tta tgg cat gcg gtg gee gaa get 528 
Gly Tyr Pro Lys Glu Asp Gin Arg Leu Trp His Ala Val Ala Glu Ala 
165 170 175 

acg ggt ctg aaa get gaa aga acg ctg ttt att gat gac age gaa gcg 57 6 

Thr Gly Leu Lys Ala Glu Arg Thr Leu Phe lie Asp Asp Ser Glu Ala 
180 185 190 

att etc gat get gee gcg caa ttt ggt att cgt tac tgc etc ggc gtg 624 
He Leu Asp Ala Ala Ala Gin Phe Gly He Arg Tyr Cys Leu Gly Val 
195 200 205 
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act aat cct gat tec ggg att gec gag aaa cag tat caa cgc cat ccg 672 
Thr Asn Pro Asp Ser Gly lie Ala Glu Lys Gin Tyr Gin Arg His Pro 
210 215 220 

tea ctg aat gac tac cgc cgc ctg ate ccc teg eta atg tga 714 
Ser Leu Asn Asp Tyr Arg Arg Leu lie Pro Ser Leu Met * 
225 230 235 



<210> 287 
<211> 402 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . . (402) 

<400> 287 

atg aaa gag aaa cct get gtt gag gtt cga ctg gat aaa tgg eta tgg 48 

Met Lys Glu Lys Pro Ala Val Glu Val Arg Leu Asp Lys Trp Leu Trp 

15 10 15 

get gee cgt ttt tat aaa ace cgc gcg ctg gee cgt gaa atg att gaa 96 
Ala Ala Arg Phe Tyr Lys Thr Arg Ala Leu Ala Arg Glu Met lie Glu 
20 25 30 

99^ ggt aag gtg cat tac aac ggg cag cgc age aag ccg age aaa ate 144 
Gly Gly Lys Val His Tyr Asn Gly Gin Arg Ser Lys Pro Ser Lys lie 
35 40 45 

gtc gag ctg aat gee acg etc act ctg cgc cag gga aat gac gaa cgc 192 
Val Glu Leu Asn Ala Thr Leu Thr Leu Arg Gin Gly Asn Asp Glu Arg 
50 55 60 

acg gtg att gta aag gcg att act gaa cag cgt cgc ccc gee age gag 240 
Thr Val He Val Lys Ala He Thr Glu Gin Arg Arg Pro Ala Ser Glu 
65 70 75 80 

gca gec ttg ctg tat gaa gag act gcg gaa agt gta gag aaa cgc gaa 288 
Ala Ala Leu Leu Tyr Glu Glu Thr Ala Glu Ser Val Glu Lys Arg Glu 
85 90 95 

aaa atg gcg ctg gca cgt aaa ctt aat gec tta acc atg ccg cac ccg 336 
Lys Met Ala Leu Ala Arg Lys Leu Asn Ala Leu Thr Met Pro His Pro 
100 105 110 

gac cga cgc ccg gac aaa aaa gag cgc cgc gac ctg tta cga ttt aaa 384 
Asp Arg Arg Pro Asp Lys Lys Glu Arg Arg Asp Leu Leu Arg Phe Lys 
115 120 ' 125 

cac ggc gac agt gaa taa 402 
His Gly Asp Ser Glu * 
130 



<210> 288 
<211> 885 
<212> DNA 
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<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . . (885) 

<400> 288 

atg att atg ccg caa cat gac caa tta cat cgc tat ctg ttt gaa aac 48 

Met lie Met Pro Gin His Asp Gin Leu His Arg Tyr Leu Phe Glu Asn 
15 10 15 



ttt gcc gtg cgc ggc gaa ctg gta acc gtt teg gaa acc ctg caa cag 96 

Phe Ala Val Arg Gly Glu Leu Val Thr Val Ser Glu Thr Leu Gin Gin 
20 25 30 

ate ctt gag aac cac gat tat ccg cag ccc gtt aaa aac gtg ctg gca 14 4 

lie Leu Glu Asn His Asp Tyr Pro Gin Pro Val Lys Asn Val Leu Ala 
35 40 45 

gaa ctg ctg gtt gcg acc age ctg tta acc get acg ctg aag ttt gat 192 

Glu Leu Leu Val Ala Thr Ser Leu Leu Thr Ala Thr Leu Lys Phe Asp 
50 55 60 



ggt gat ate acc gta cag ctg cag ggc gac ggt ccg atg aat ctg gcg 240 

Gly Asp lie Thr Val Gin Leu Gin Gly Asp Gly Pro Met Asn Leu Ala 

65 70 75 80 

gtt att aac ggt aac aat aac cag cag atg cgc ggt gtg gcg cgc gtg 288 

Val He Asn Gly Asn Asn Asn Gin Gin Met Arg Gly Val Ala Arg Val 

85 90 95 • 

cag ggc gaa att cca gaa aat gcc gac ctg aaa acg ctg gtc ggc aat 336 

Gin Gly Glu He Pro Glu Asn Ala Asp Leu Lys Thr Leu Val Gly Asn 

100 105 110 

ggt tac gtg gtg ate acc att acc ccg age gaa ggc gaa cgc tat cag 384 

Gly Tyr Val Val He Thr He Thr Pro Ser Glu Gly Glu Arg Tyr Gin 

115 120 125 

ggc gta gtt ggt ctg gaa ggt gat acc ctg gcg gcc tgc ctg gaa gat 432 

Gly Val Val Gly Leu Glu Gly Asp Thr Leu Ala Ala Cys Leu Glu Asp 

130 ~ 135 140 

tac ttt atg cgt tct gaa cag ctg ccg acg cgc ctg ttt att cgc acc 480 

Tyr Phe Met Arg Ser Glu Gin Leu Pro Thr Arg Leu Phe He Arg Thr 

145 150 155 160 

ggc gac gta gac ggc aaa ccg get gca ggc ggt atg ttg ttg cag gta 528 

Gly Asp Val Asp Gly Lys Pro Ala Ala Gly Gly Met Leu Leu Gin Val 

165 170 175 

atg cct gcg caa aat gcc cag cag gac gac ttt gac cac ctg gcg acg 576 

Met Pro Ala Gin Asn Ala Gin Gin Asp Asp Phe Asp His Leu Ala Thr 

180 185 190 

eta acc gaa acc ate aaa acc gaa gaa ctg ctg acc tta ccg gca aac 624 

Leu Thr Glu Thr He Lys Thr Glu Glu Leu Leu Thr Leu Pro Ala Asn 

195 200 205 

gaa gtg ttg tgg cgt ttg tat cac gaa gaa gag gtg acg gtt tac gat 672 
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Glu Val Leu Trp Arg Leu Tyr His Glu Glu Glu Val Thr Val Tyr Asp 
210 215 220 

ccg cag gat gtg gag ttc aaa tgc acc tgc teg cgt gaa cgt tgc gec 720 
Pro Gin Asp Val Glu Phe Lys Cys Thr Cys Ser Arg Glu Arg Cys Ala 
225 230 235 240 

gat gcg ctg aaa acg ctg cct gat gaa gaa gtt gat age ate ctg gcg 768 
Asp Ala Leu Lys Thr Leu Pro Asp Glu Glu Val Asp Ser lie Leu Ala 
245 250 255 

gaa gat ggc gaa att gac atg cat tgt gat tac tgc ggt aac cac tat 816 
Glu Asp Gly Glu lie Asp Met His Cys Asp Tyr Cys Gly Asn His Tyr 
260 265 270 

ctg ttc aat gcg atg gat att get gaa ate cgc aac aac gcg tct ccg 864 
Leu Phe Asn Ala Met Asp lie Ala Glu lie Arg Asn Asn Ala Ser Pro 
275 280 285 

gca gat ccg caa gtt cat taa 885 
Ala Asp Pro Gin Val His * 
290 



<210> 289 
<211> 363 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . . (363) 

<400> 289 

atg tta aag eta ttt gca aag tac acc tct att ggt gtg ctg aac acc 48 

Met Leu Lys Leu Phe Ala Lys Tyr Thr Ser He Gly Val Leu Asn Thr 
1 5 10 15 

ctt ata cac tgg gtg gtt ttt ggt gtt tgt ate tat gtc gcg cat aca 96 
Leu He His Trp Val Val Phe Gly Val Cys He Tyr Val Ala His Thr 
20 25 30 

aac caa get ctt gca aac ttc gca ggt ttc gtt gtg get gtg age ttt 144 
Asn Gin Ala Leu Ala Asn Phe Ala Gly Phe Val Val Ala Val Ser Phe 
35 40 45 

age ttc ttc gcg aat gca aaa ttc aca ttc aag gca teg act aca acg 192 
Ser Phe Phe Ala Asn Ala Lys Phe Thr Phe Lys Ala Ser Thr Thr Thr 
50 55 60 

atg cgc tac atg eta tat gtt ggg ttc atg ggg aca ctg agt get act 240 
Met Arg Tyr Met Leu Tyr Val Gly Phe Met Gly Thr Leu Ser Ala Thr 
65 " 70 75 80 

gtt gga tgg get get gat aga tgc gca ctt ccc ccg atg ata act ctt 288 
Val Gly Trp Ala Ala Asp Arg Cys Ala Leu Pro Pro Met He Thr Leu 
85 90 95 

gtc acc ttc tec gec ate age ctg gtg tgc ggt ttc gtc tat tea aag 336 
Val Thr Phe Ser Ala He Ser Leu Val Cys Gly Phe Val Tyr Ser Lys 
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100 



105 



110 



ttc att gtc ttt agg gat gcg aaa tga 
Phe He Val Phe Arg Asp Ala Lys * 
115 120 



363 



<210> 290 
<211> 921 
<212> DNA 

<213> Escherichia coli 



<220> 
<221> CDS 
<222> (1) . 



(921) 



<400> 290 

atg aag ata tct ctt gta gtt cct gtc ttc aat gaa gaa gaa gcg ata 
Met Lys lie Ser Leu Val Val Pro Val Phe Asn Glu Glu Glu Ala He 
15 10 15 



48 



cca att ttt tat aaa acg gta cgt gaa ttc gaa gaa ttg aag tea tat 
Pro He Phe Tyr Lys Thr Val Arg Glu Phe Glu Glu Leu Lys Ser Tyr 
20 25 30 



96 



gaa gtg gaa ate gtt ttc ata aat gac ggc age aaa gac get acg gag 
Glu Val Glu He Val Phe He Asn Asp Gly Ser Lys Asp Ala Thr Glu 
35 40 45 



144 



tea ate att aat get ctg get gtt tea gat cct eta gtt gtt ccg ctg 
Ser He He Asn Ala Leu Ala Val Ser Asp Pro Leu Val Val Pro Leu 
50 55 60 



192 



tea ttt aca cgc aac ttt ggt aaa gaa cca gca ttg ttt gca ggg tta 
Ser Phe Thr Arg Asn Phe Gly Lys Glu Pro Ala Leu Phe Ala Gly Leu 
65 70 75 80 



240 



gac cat gca acc ggg gat gcg ata ate cca att gat gtt gac ctg caa 
Asp His Ala Thr Gly Asp Ala He He Pro He Asp Val Asp Leu Gin 
85 90 95 



288 



gac ccg att gag gtt att cct cat ctt att gaa aaa tgg caa gca ggt 
Asp Pro He Glu Val He Pro His Leu lie Glu Lys Trp Gin Ala Gly 
100 105 110 



336 



get gat atg gtt ctt get aaa aga tct gac cgc tea act gat gga cgc 
Ala Asp Met Val Leu Ala Lys Arg Ser Asp Arg Ser Thr Asp Gly Arg 
115 120 125 



384 



ctg aag cga aaa acg get gag tgg ttc tat aag etc cac aat aaa ata 
Leu Lys Arg Lys Thr Ala Glu Trp Phe Tyr Lys Leu His Asn Lys He 
130 135 140 



432 



age aat cct aaa att gaa gag aat gtt ggt gat ttc agg ctg atg age 
Ser Asn Pro Lys He Glu Glu Asn Val Gly Asp Phe Arg Leu Met Ser 
145 150 155 160 



480 



cgt gat gtt gtc gaa aat att aaa ctt atg cca gaa cga aac ctt ttc 
Arg Asp Val Val Glu Asn He Lys Leu Met Pro Glu Arg Asn Leu Phe 
165 170 175 



528 
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atg aaa ggt att ctg age tgg gta gga gga aag aca gat att gtt gaa 576 
Met Lys Gly He Leu Ser Trp Val Gly Gly Lys Thr Asp He Val Glu 
180 185 190 

tac gtg cga gcg gaa aga att get gga gat aca aaa ttt aat gga tgg 624 
Tyr Val Arg Ala Glu Arg lie Ala Gly Asp Thr Lys Phe Asn Gly Trp 
195 200 205 

aaa ctt tgg aat tta gca ctt gag ggt att aca age ttt tec aca ttc 672 
Lys Leu Trp Asn Leu Ala Leu Glu Gly He Thr Ser Phe Ser Thr Phe 
210 215 220 

cct ctt cgc ate tgg aca tac ata ggg tta gtg gta gec agt gta gca 720 
Pro Leu Arg lie Trp Thr Tyr lie Gly Leu Val Val Ala Ser Val Ala 
225 230 235 240 

ttt att tat ggg gcg tgg atg att tta gat. act ate ata ttt gga aat 768 
Phe He Tyr Gly Ala Trp Met He Leu Asp Thr He He Phe Gly Asn 
245 250 255 

get gtt agg gga tat cct tea eta ctt gtt tea ata ctg ttt tta ggt 816 
Ala Val Arg Gly Tyr Pro Ser Leu Leu Val Ser He Leu Phe Leu Gly 
260 265 270 

gga att cag atg att gga ata gga gta tta ggt gaa tat att gga cgc 8 64 

Gly He Gin Met He Gly He Gly Val Leu Gly Glu Tyr He Gly Arg 
275 280 285 

aca tac att gaa ace aaa aaa cgc ccg aaa tac ate ate aag aga gtc 912 
Thr Tyr He Glu Thr Lys Lys Arg Pro Lys Tyr He He Lys Arg Val 
290 295 300 

aaa aaa tga 921 

Lys Lys * 

305 



<210> 291 
<211> 1332 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . . (1332) 

<400> 291 

atg aat aaa gca ata aaa gta tea ttg tat ata tct ttt gtt ttg att 48 

Met Asn Lys Ala He Lys Val Ser Leu Tyr He Ser Phe Val Leu He 
1 5 10 15 

att tgc gee tta tct aaa aac ata atg atg tta aat aca tct gat ttc 96 
He Cys Ala Leu Ser Lys Asn He Met Met Leu Asn Thr Ser Asp Phe 
20 . 25 30 

gga aga gee att aag cca tta att gaa gac ata cca gca ttt aca tat 144 
Gly Arg Ala He Lys Pro Leu He Glu Asp He Pro Ala Phe Thr Tyr 
35 40 45 
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gac tta cct tta ttg tat aaa ttg aaa ggt cat att gat tea att gat 192 

Asp Leu Pro Leu Leu Tyr Lys Leu Lys Gly His lie Asp Ser lie Asp 
50 55 60 

age tat gag tat ata agt tea tat agt tat att ttg tat aca tac gtc 240 

Ser Tyr Glu Tyr lie Ser Ser Tyr Ser Tyr lie Leu Tyr Thr Tyr Val 
65 70 75 80 

ctg ttt att age att ttt act gaa tat ctt gat get agg gtg tta teg 288 

Leu Phe lie Ser lie Phe Thr Glu Tyr Leu Asp Ala Arg Val Leu Ser 
85 90 " 95 

tta ttt eta aaa gta ata tat att tat tea tta tat gcg ata ttt act 336 

Leu Phe Leu Lys Val lie Tyr lie Tyr Ser Leu Tyr Ala He Phe Thr 

100 105 110 

tea tat ata aaa aca gaa agg tat gta act tta ttt aca ttc ttt att 384 

Ser Tyr He Lys Thr Glu Arg Tyr Val Thr Leu Phe Thr Phe Phe He 
115 120 125 

tta get ttt ctt atg tgt tct tea tea aca ctg tea atg ttt gca tea 432 

Leu Ala Phe Leu Met Cys Ser Ser Ser Thr Leu Ser Met Phe Ala Ser 
130 135 140 

ttc tat caa gag caa ata gtt ata att ttc ctt cca ttt ttg gtg tat 480 

Phe Tyr Gin Glu Gin He Val He He Phe Leu Pro Phe Leu Val Tyr 
145 150 155 160 

tea tta aca tgc aaa aac aat aaa tct atg ctt ttg eta ttt ttt teg 528 

Ser Leu Thr Cys Lys Asn Asn Lys Ser Met Leu Leu Leu Phe Phe Ser 
165 170 175 

ttg eta ata ata tct act get aaa aat caa ttt ata tta ace cca eta 576 

Leu Leu He He Ser Thr Ala Lys Asn Gin Phe He Leu Thr Pro Leu 

180 185 190 

ata gtg tat tea tat tat att ttt ttt gat aga cac aaa eta att att 624 

He Val Tyr Ser Tyr Tyr He Phe Phe Asp Arg His Lys Leu He He 
195 200 205 

aaa tct gta ata tgc gtg gtg tgc ttg ctt gcg tea ata ttt gca ata 672 

Lys Ser Val lie Cys Val Val Cys Leu Leu Ala Ser He Phe Ala He 
210 215 220 

tct tat tea aaa ggt gtt gtt gaa tta aat aag tac cat gca aca tac 720 

Ser Tyr Ser Lys Gly Val Val Glu Leu Asn Lys Tyr His Ala Thr Tyr 
225 230 235 " 240 

ttc ggt agt tat ctt tat atg aaa aac aac ggg tat aaa atg cca teg 7 68 

Phe Gly Ser Tyr Leu Tyr Met Lys Asn Asn Gly Tyr Lys Met Pro Ser 
245 250 255 

tat gtt gat gat aag tgt gtt ggg tta gat gee tgg ggt aat aaa ttc 816 

Tyr Val Asp Asp Lys Cys Val Gly Leu Asp Ala Trp Gly Asn Lys Phe 

260 265 270 

gac ata tea ttt ggc gca ace cca aca gaa gtt gga acg gaa tgt ttc 864 

Asp He Ser Phe Gly Ala Thr Pro Thr Glu Val Gly Thr Glu Cys Phe 
275 280 285 
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gaa tct cat aaa gat gaa acg ttt teg aat gca etc ttt tta ttg gtt 912 
Glu Ser His Lys Asp Glu Thr Phe Ser Asn Ala Leu Phe Leu Leu Val 
290 295 300 

age aaa cca age ace ate ttc aaa ctt cca ttt gat gat ggt gtg atg 960 
Ser Lys Pro Ser Thr lie Phe Lys Leu Pro Phe Asp Asp Gly Val Met 
305 310 315 320 

tct cag tat aaa gaa aat tat ttc cat gta tat aaa aaa eta cac gta 1008 
Ser Gin Tyr Lys Glu Asn Tyr Phe His Val Tyr Lys Lys Leu His Val 
325 330 335 

ata tat gga gaa tea aac ata eta acg act att act aac ata aaa gac 1056 
He Tyr Gly Glu Ser Asn He Leu Thr Thr He Thr Asn He Lys Asp 
340 345 350 

aat ata ttt aaa aac att aga ttt ata tea ttg tta tta ttt ttt att 1104 
Asn He Phe Lys Asn He Arg Phe He Ser Leu Leu Leu Phe Phe He 
355 360 365 

get tct att ttt att aga aat aat aaa ata aag gca tct tta ttt gta 1152 
Ala Ser He Phe He Arg Asn Asn Lys He Lys Ala Ser Leu Phe Val 
370 375 " 380 

gta tct ctt ttt gga ata tct caa ttt tat gtg tea ttt ttc ggg gaa 1200 
Val Ser Leu Phe Gly He Ser Gin Phe Tyr Val Ser Phe Phe Gly Glu 
385 390 395 400 

gga tat aga gat tta age aag cat tta ttt gga atg tat ttt teg ttc 1248 
Gly Tyr Arg Asp Leu Ser Lys His Leu Phe Gly Met Tyr Phe Ser Phe 
405 410 ' 415 

gac ctt tgc tta tac ata aca gtc gtt ttt tta att tat aaa ata att 1296 
Asp Leu Cys Leu Tyr He Thr Val Val Phe Leu He Tyr Lys He lie 
420 425 430 

caa aga aat caa gac aat age gat gta aag cac taa 1332 
Gin Arg Asn Gin Asp Asn Ser Asp Val Lys His * 
435 440 



<210> 292 
<211> 345 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . . (345) 

<400> 292 

ttg att etc gat aga cac ata aca tec agg ggt tat gtt tgt atg cgt 4 8 

Met He Leu Asp Arg His He Thr Ser Arg Gly Tyr Val Cys Met Arg 
15 10 15 

ccc gcg act ate gee cca tta acg cca tac gat aaa tgg gat ggt gag 96 
Pro Ala Thr lie Ala Pro Leu Thr Pro Tyr Asp Lys Trp Asp Gly Glu 
20 25 30 

aaa tgg gtg acg gat ace gag gca cag cat age gtc gca gta gat gca 144 
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Lys Trp Val Thr Asp Thr Glu Ala Gin His Ser Val Ala Val Asp Ala 
35 40 45 

gca gaa gca cag cgc cag teg ctg att gat act gca atg get tec att 192 
Ala Glu Ala Gin Arg Gin Ser Leu He Asp Thr Ala Met Ala Ser He 
50 55 60 

agt ctg att caa ctg aaa tta cag get ggg egg aag ctg atg cag gca 240 
Ser Leu He Gin Leu Lys Leu Gin Ala Gly Arg Lys Leu Met Gin Ala 
65 70 75 80 

gag ace tec cga ctt aac act gtg ctg gat tac att gac gcg gtg acg 288 
Glu Thr Ser Arg Leu Asn Thr Val Leu Asp Tyr lie Asp Ala Val Thr 
85 90 95 

gca aca gat acc age acc gcg ccg gat gtc ate tgg cct gaa ctg ccg 336 
Ala Thr Asp Thr Ser Thr Ala Pro Asp Val lie Trp Pro Glu Leu Pro 
100 105 110 

gag gag tag 345 
Glu Glu * 



<210> 293 
<211> 447 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . (447) 

<400> 293 

atg aag cca cga aat att aat aat age eta cca ctg caa cca tta gtt 48 

Met Lys Pro Arg Asn lie Asn Asn Ser Leu Pro Leu Gin Pro Leu Val 

15 10 15 

cct gat cag gag aac aaa aat aag aaa aat gaa gag aaa tec gtt aat 96 
Pro Asp Gin Glu Asn Lys Asn Lys Lys Asn Glu Glu Lys Ser Val Asn 
20 25 30 

cca gtt aaa ate aca atg ggg tct ggt tta aat tat att gaa caa gaa 144 
Pro Val Lys He Thr Met Gly Ser Gly Leu Asn Tyr He Glu Gin Glu 
35 40 45 

tct ctt gga gga aaa tat eta aca cat gat ttg tea ata aag ata gcg 192 
Ser Leu Gly Gly Lys Tyr Leu Thr His Asp Leu Ser lie Lys He Ala 
50 55 60 

gat att tct gaa gag ata att cag caa gca ata tta tct get atg age 240 
Asp He Ser Glu Glu He He Gin Gin Ala He Leu Ser Ala Met Ser 
65 70 75 80 

ata tat aaa ttt teg ata aca gat gat tta atg agt atg get gta aat 288 
He Tyr Lys Phe Ser He Thr Asp Asp Leu Met Ser Met Ala Val Asn 
85 90 95 

gaa etc ata aaa ctg acc aaa ata gag aat aat gta gac ctg aat aaa 336 
Glu Leu He Lys Leu Thr Lys He Glu Asn Asn Val Asp Leu Asn Lys 
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100 

ttc act act ata tgc aca 
Phe Thr Thr lie Cys Thr 
115 

aat aaa gaa aaa aac aaa 
Asn Lys Glu Lys Asn Lys 
130 

ttt aat ttt cat tga 
Phe Asn Phe His * 
145 



105 

gac gtt eta tec ccc cgc 
Asp Val Leu Ser Pro Arg 
120 

cga cat tct acc ctt etc 
Arg His Ser Thr Leu Leu 
135 140 



110 

gtc acc aga cat 384 

Val Thr Arg His 

125 

aaa aat ccc ctt 432 
Lys Asn Pro Leu 



447 



<210> 294 
<211> 504 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . . (504) 

<400> 294 

atg ccg ggc aac cgc ccg cat tat ggg cgt tgg cct caa cac gat ttt 48 

Met Pro Gly Asn Arg Pro His Tyr Gly Arg Trp Pro Gin His Asp Phe 
15 10 15 

ccg cca ttt aaa aaa etc agg ccg cag teg gta acc teg cgc ata cag 96 
Pro Pro Phe Lys Lys Leu Arg Pro Gin Ser Val Thr Ser Arg lie Gin 
20 " 25 30 

ccg ggc agt gac gtc ate gtc tgc gcg gaa atg gac gaa cag tgg gga 14 4 

Pro Gly Ser Asp Val lie Val Cys Ala Glu Met Asp Glu Gin Trp Gly 
35 40 45 

tac gtc ggg get aaa teg cgc cag cgc tgg ctg ttt tac gcg tat gac 192 
Tyr Val Gly Ala Lys Ser Arg Gin Arg Trp Leu Phe Tyr Ala Tyr Asp 
50 55 60 

agg etc egg aag acg gtt gtt gcg cac gta ttc ggt gaa cgc act atg 24 0 

Arg Leu Arg Lys Thr Val Val Ala His Val Phe Gly Glu Arg Thr Met 
65 70 75 80 

gcg acg ctg ggg cgt ctt atg age ctg ctg tea ccc ttt gac gtg gtg 288 
Ala Thr Leu Gly Arg Leu Met Ser Leu Leu Ser Pro Phe Asp Val Val 
85 90 95 

ata tgg atg acg gat ggc tgg ccg ctg tat gaa tec cgc ctg aag gga 336 
lie Trp Met Thr Asp Gly Trp Pro Leu Tyr Glu Ser Arg Leu Lys Gly 
100 * 105 110 

aag ctg cac gta ate age aag cga tat acg cag cga att gag egg cat 384 
Lys Leu His Val lie Ser Lys Arg Tyr Thr Gin Arg lie Glu Arg His 
115 120 125 

aac ctg aat ctg agg cag cac ctg gca egg ctg gga egg aag teg ctg 432 
Asn Leu Asn Leu Arg Gin His Leu Ala Arg Leu Gly Arg Lys Ser Leu 
130 135 140 
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teg ttc tea aaa teg gtg gag cag cat gac aaa gtc ate ggg cat tat 480 
Ser Phe Ser Lys Ser Val Glu Gin His Asp Lys Val lie Gly His Tyr 
145 " 150 155 160 

ctg aac ata aaa cac tat caa taa 504 
Leu Asn lie Lys His Tyr Gin * 
165 



<210> 295 
<211> 276 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . . (276) 

<400> 295 

gtg get tct gtt tct ate age tgt ccc tec tgt tea get act gac ggg 4 8 

Met Ala Ser Val Ser lie Ser Cys Pro Ser Cys Ser Ala Thr Asp Gly 
15 10 15 

gtg gtg cgt aac ggc aaa age act gee gga cat cag cgc tat etc tgc 96 
Val Val Arg Asn Gly Lys Ser Thr Ala Gly His Gin Arg Tyr Leu Cys 
20 25 30 

tct cac tgc cgt aaa aca tgg caa ctg cag ttc act tac ace get tct 14 4 

Ser His Cys Arg Lys Thr Trp Gin Leu Gin Phe Thr Tyr Thr Ala Ser 
35 40 45 

caa ccc ggt acg cac cag aaa ate att gat atg gee atg aat ggc gtt 192 
Gin Pro Gly Thr His Gin Lys lie lie Asp Met Ala Met Asn Gly Val 
50 55 60 

gga tgc egg gca ace gec cgc att atg ggc gtt ggc etc aac acg att 24 0 

Gly Cys Arg Ala Thr Ala Arg He Met Gly Val Gly Leu Asn Thr He 
65 70 75 80 

ttc cgc cat tta aaa aac tea ggc cgc agt egg taa 276 
Phe Arg His Leu Lys Asn Ser Gly Arg Ser Arg * 
85 90 



<210> 296 
<211> 4134 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1)...(4134) 

<400> 296 

atg age gga aaa ccg gca gcg cgt cag ggc gac atg acg cag tat ggc 48 

Met Ser Gly Lys Pro Ala Ala Arg Gin Gly Asp Met Thr Gin Tyr Gly 
15 10 15 

ggt age att gtt cag ggt tea gee ggg gtg cgc att ggt gee ccc acc 96 
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Gly Ser lie Val Gin Gly Ser Ala Gly Val Arg lie Gly Ala Pro Thr 
20 25 30 

ggc gtg gcc tgt teg gtg tgc ccc ggc gga gtg acg tec ggc cat ccg 14 4 

Gly Val Ala Cys Ser Val Cys Pro Gly Gly Val Thr Ser Gly His Pro 
35 40 45 

gtc aat ccc ctg etc ggt gca aag gtc ctt ccc ggt gaa ace gac ate 192 
Val Asn Pro Leu Leu Gly Ala Lys Val Leu Pro Gly Glu Thr Asp lie 
50 55 60 

gcc ctg ccc ggc ccg ctg ccg ttc ate etc tec cgc ace tac age agt 240 
Ala Leu Pro Gly Pro Leu Pro Phe He Leu Ser Arg Thr Tyr Ser Ser 
65 70 75 80 

tac egg aca aaa acg ccc gcg ccg gtg ggg age etc ggc ccc ggc tgg 288 
Tyr Arg Thr Lys Thr Pro Ala Pro Val Gly Ser Leu Gly Pro Gly Trp 
85 90 95 

aaa atg cct gcg gat ate cgc tta cag ctg cgc gat aac aca ctg ata 336 
Lys Met Pro Ala Asp He Arg Leu Gin Leu Arg Asp Asn Thr Leu He 
100 105 110 

etc agt gat aac ggc ggc aga age ctg tat ttt gag cac ctg ttt ccc 384 
Leu Ser Asp Asn Gly Gly Arg Ser Leu Tyr Phe Glu His Leu Phe Pro 
115 " " 120 125 

ggt gag gac ggt tac age cgc age gag tea ctg tgg ctg gtg cgc ggc 432 
Gly Glu Asp Gly Tyr Ser Arg Ser Glu Ser Leu Trp Leu Val Arg Gly 
130 ' 135 140 

ggc gtg gcg aaa ctg gat gaa ggt cac egg ctg gcc gca etc tgg cag 4 80 

Gly Val Ala Lys Leu Asp Glu Gly His Arg Leu Ala Ala Leu Trp Gin 
145 150 155 160 

gcg ctg ccg gaa gaa etc cgc tta agt ccg cat cgt tat ctg gcg aca 528 
Ala Leu Pro Glu Glu Leu Arg Leu Ser Pro His Arg Tyr Leu Ala Thr 
165 170 175 

aac agt ccg cag ggg ccg tgg tgg ctg etc ggt tgg tgt gag egg gtg 576 
Asn Ser Pro Gin Gly Pro Trp Trp Leu Leu Gly Trp Cys Glu Arg Val 
180 185 190 

ccg gaa gcg gat gag gtg ctg cct gcg ccg ctg ccg ccg tac egg gta 624 
Pro Glu Ala Asp Glu Val Leu Pro Ala Pro Leu Pro Pro Tyr Arg Val 
195 200 205 

ctg acc ggg ctg gtg gac cgc ttc ggg cgc aca cag acg ttc cac cgc 672 
Leu Thr Gly Leu Val Asp Arg Phe Gly Arg Thr Gin Thr Phe His Arg 
210 * 215 220 

gaa gcc gcc ggt gaa ttc age ggc gaa ate acc ggc gtg acg gat ggt 720 
Glu Ala Ala Gly Glu Phe Ser Gly Glu He Thr Gly Val Thr Asp Gly 
225 230 235 240 

gcc tgg cgt cac ttc egg ctg gta ctg acc acg cag gcg cag egg gca 768 
Ala Trp Arg His Phe Arg Leu Val Leu Thr Thr Gin Ala Gin Arg Ala 
245 250 25S 

gaa gaa gcc egg cag cag gcc att tec ggc ggg acg gaa ccg tec get 816 
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Glu Glu Ala Arg Gin Gin Ala lie Ser Gly Gly Thr Glu Pro Ser Ala 
260 265 270 

ttt cct gat acc ctg ccg ggt tac acc gaa tat ggc egg gac aac ggc 864 
Phe Pro Asp Thr Leu Pro Gly Tyr Thr Glu Tyr Gly Arg Asp Asn Gly 
275 280 285 

ate cgt ctg tct gec gtg tgg ctg acg cac gac ccg gaa tac ccg gag 912 
lie Arg Leu Ser Ala Val Trp Leu Thr His Asp Pro Glu Tyr Pro Glu 
290 295 300 

aat tta cct gec gcg ccg ctg gtg cgc tat ggc tgg acg cca cgc ggc 960 
Asn Leu Pro Ala Ala Pro Leu Val Arg Tyr Gly Trp Thr Pro Arg Gly 
305 310 315 320 

gaa ctg gcg gtg gtg tat gac cgt agt ggc aaa cag gtg cgc age ttt 1008 
Glu Leu Ala Val Val Tyr Asp Arg Ser Gly Lys Gin Val Arg Ser Phe 
325 330 335 

act tac gat gat aaa tac egg ggc egg atg gtg gcg cac cgt cac acg 1056 
Thr Tyr Asp Asp Lys Tyr Arg Gly Arg Met Val Ala His Arg His Thr 
340 345 350 

ggc egg ccg gaa ate cgt tac cgt tac gac age gac ggg egg gtg aca 1104 
Gly Arg Pro Glu lie Arg Tyr Arg Tyr Asp Ser Asp Gly Arg Val Thr 
355 360 365 

gaa cag eta aac ccg gca ggc tta age tac acg tat cag tat gag aaa 1152 
Glu Gin Leu Asn Pro Ala Gly Leu Ser Tyr Thr Tyr Gin Tyr Glu Lys 
370 375 ~ 380 

gac cgc ate acc ate acc gac age ctg gac cgc cgt gaa gtg ctg cac 1200 
Asp Arg lie Thr lie Thr Asp Ser Leu Asp Arg Arg Glu Val Leu His 
385 390 395 400 

acg cag ggc gaa gee ggg ctg aag egg gtg gtg aaa aag gaa cac gcg 1248 
Thr Gin Gly Glu Ala Gly Leu Lys Arg Val Val Lys Lys Glu His Ala 
405 410 415 

gac ggc age gtc acg cag agt cag ttt gac gec gtg ggc agg etc agg 1296 
Asp Gly Ser Val Thr Gin Ser Gin Phe Asp Ala Val Gly Arg Leu Arg 
420 425 430 

gca cag acg gat gee gca ggc agg aca aca gag tac age ccg gat gtg 1344 
Ala Gin Thr Asp Ala Ala Gly Arg Thr Thr Glu Tyr Ser Pro Asp Val 
435 440 445 

gtg acg ggc etc ate acg cgc ata acc acg ccg gat ggc agg gca teg 1392 
Val Thr Gly Leu lie Thr Arg lie Thr Thr Pro Asp Gly Arg Ala Ser 
450 455 460 

gcg ttt tac tat aac cac cac aac cag tta acg tea gee acc ggg cct 1440 
Ala Phe Tyr Tyr Asn His His Asn Gin Leu Thr Ser Ala Thr Gly Pro 
465 470 475 480 

gac ggg ctg gaa ttg cgc egg gaa tat gat gaa ttg ggc cgt ctg att 1488 
Asp Gly Leu Glu Leu Arg Arg Glu Tyr Asp Glu Leu Gly Arg Leu lie 
485 490 495 

cag gaa act gec cct gac ggc gat ate acc cgc tac cgt tat gat aat 1536 



-280- 



WO 01/34810 



PCT/US00/30950 



Gin Glu Thr Ala Pro Asp Gly Asp lie Thr Arg Tyr Arg Tyr Asp Asn 
500 505 510 

cca cac agt gac tta ccc tgc gca acg gaa gat gcc acc ggc age egg 1584 
Pro His Ser Asp Leu Pro Cys Ala Thr Glu Asp Ala Thr Gly Ser Arg 
515 520 525 

aaa acc atg acg tgg age cgt tac ggt cag ttg ctg age ttc acc gac 1632 
Lys Thr Met Thr Trp Ser Arg Tyr Gly Gin Leu Leu Ser Phe Thr Asp 
530 535 540 

tgt tec ggt tat gta acc cgt tat gac cat gac cgc ttc ggg cag atg 1680 
Cys Ser Gly Tyr Val Thr Arg Tyr Asp His Asp Arg Phe Gly Gin Met 
545 550 555 560 

acg gcg gtg cac cgc gag gaa ggg ctg agt cag tac cgc gca tac gac 1728 
Thr Ala Val His Arg Glu Glu Gly Leu Ser Gin Tyr Arg Ala Tyr Asp 
565 570 575 

age cgt gga cag tta att gcc gtg aaa gac acg cag ggc cat gaa acg 177 6 

Ser Arg Gly Gin Leu He Ala Val Lys Asp Thr Gin Gly His Glu Thr 
580 585 590 

egg tat gaa tac aac ate gcc ggt gac ctg acc gcc gtc att gcc ccg 1824 
Arg Tyr Glu Tyr Asn He Ala Gly Asp Leu Thr Ala Val He Ala Pro 
595 600 605 

gac ggc age aga aac ggg aca cag tac gat gcg tgg gga aag gcc gtc 1872 
Asp Gly Ser Arg Asn Gly Thr Gin Tyr Asp Ala Trp Gly Lys Ala Val 
610 615 620 

cgt acc acg cag ggc ggg eta acg cgc agt atg gaa tac gat get gcc 1920 
Arg Thr Thr Gin Gly Gly Leu Thr Arg Ser Met Glu Tyr Asp Ala Ala 
625 630 635 640 

gga egg gtc ate cgc ctg acc agt gaa aac ggc age cac acc acc ttc 1968 
Gly Arg Val He Arg Leu Thr Ser Glu Asn Gly Ser His Thr Thr Phe 
645 650 655 

cgt tac gat gta ctt gac egg ctg ata cag gaa acc ggc ttt gac ggc 2016 
Arg Tyr Asp Val Leu Asp Arg Leu He Gin Glu Thr Gly Phe Asp Gly 
660 665 670 

cgc aca cag cgt tat cac cac gac ctg acc ggc aaa ctt ate cgc age 2064 
Arg Thr Gin Arg Tyr His His Asp Leu Thr Gly Lys Leu He Arg Ser 
675 680 685 

gag gat gag ggt ctg gtc acc cac tgg cac tat gac gaa gca gac cgc 2112 
Glu Asp Glu Gly Leu Val Thr His Trp His Tyr Asp Glu Ala Asp Arg 
690 695 700 

etc acg cac cgc acc gtg aag ggt gaa acc gca gag egg tgg cag tat 2160 
Leu Thr His Arg Thr Val Lys Gly Glu Thr Ala Glu Arg Trp Gin Tyr 
705 710 715 720 

gac gaa cgt ggc tgg ctg aca gac ate age cat ate age gaa ggg cac 2208 
Asp Glu Arg Gly Trp Leu Thr Asp lie Ser His He Ser Glu Gly His 
725 730 735 

egg gtg gcg gtg cat tac agg tat gat gag aaa ggc egg ctg acc ggt 2256 
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Arg Val Ala Val His Tyr Arg Tyr Asp Glu Lys Gly Arg Leu Thr Gly 
740 745 750 

gag cgt cag acg gtg cat cac ccg cag acg gaa gca ctg etc tgg cag 2304 
Glu Arg Gin Thr Val His His Pro Gin Thr Glu Ala Leu Leu Trp Gin 
755 760 765 

cat gag acc aga cat gcg tac aac gcg cag ggg ctg gcg aac cgc tgt 2352 
His Glu Thr Arg His Ala Tyr Asn Ala Gin Gly Leu Ala Asn Arg Cys 
770 775 780 

ata ccg gac age ctg ccc gec gtg gaa tgg ctg acc tac ggc age ggt 2400 
lie Pro Asp Ser Leu Pro Ala Val Glu Trp Leu Thr Tyr Gly Ser Gly 
785 790 795 800 

tac ctg gca ggc atg aaa etc ggc gac aca ccg ctg gtg gag tac acc 24 48 

Tyr Leu Ala Gly Met Lys Leu Gly Asp Thr Pro Leu Val Glu Tyr Thr 
805 810 815 

cgc gac cgc ctg cac egg gaa acg ctg cgc age ttc ggc cgt tat gaa 24 96 

Arg Asp Arg Leu His Arg Glu Thr Leu Arg Ser Phe Gly Arg Tyr Glu 
820 825 " 830 

etc acc acc get tat acc cct gee ggg cag tta cag age cag cac ctg 2544 
Leu Thr Thr Ala Tyr Thr Pro Ala Gly Gin Leu Gin Ser Gin His Leu 
835 840 845 

aac age ctg ctg tct gac cgc gat tac acc tgg aac gac aac ggc gaa 2592 
Asn Ser Leu Leu Ser Asp Arg Asp Tyr Thr Trp Asn Asp Asn Gly Glu 
850 855 860 

etc ate cgc ate age age ccg cgc cag acc egg agt tac age tac age 264 0 

Leu lie Arg He Ser Ser Pro Arg Gin Thr Arg Ser Tyr Ser Tyr Ser 
865 870 875 880 

acc acc ggc agg ctg acc ggc gtt cac acc acc gca gcg aat ctg gat 2 688 

Thr Thr Gly Arg Leu Thr Gly Val His Thr Thr Ala Ala Asn Leu Asp 
885 890 895 

ate cgc ate ccg tat gee aca gac ccg gca ggt aac cgc ctg ccc gac 2736 
He Arg He Pro Tyr Ala Thr Asp Pro Ala Gly Asn Arg Leu Pro Asp 
900 905 910 

ccg gag ctg cac ccg gac age acc etc age atg tgg ccg gat aac cgt 2784 
Pro Glu Leu His Pro Asp Ser Thr Leu Ser Met Trp Pro Asp Asn Arg 
915 920 925 

ate gee cgt gac gcg cac tat ctt tac egg tat gac cgt cac ggc agg 2832 
He Ala Arg Asp Ala His Tyr Leu Tyr Arg Tyr Asp Arg His Gly Arg 
930 935 ' 940 

ctg aca gag aaa acc gac etc ate ccg gaa ggg gtt ate cgc acg gat 2880 
Leu Thr Glu Lys Thr Asp Leu He Pro Glu Gly Val lie Arg Thr Asp 
945 950 955 960 

gat gag egg act cac egg tac cat tac gac agt cag cac egg ctg gtg 2 928 
Asp Glu Arg Thr His Arg Tyr His Tyr Asp Ser Gin His Arg Leu Val 
965 970 975 

cac tac acg egg aca caa tat gaa gag ccg ctg gtc gaa agt cgc tat 2976 
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His Tyr Thr Arg Thr Gin Tyr Glu Glu Pro Leu Val Glu Ser Arg Tyr 
980 985 990 

ctt tac gac ccg ctg ggc cgc agg gtg gca aaa egg gtg tgg egg cgt 3024 
Leu Tyr Asp Pro Leu Gly Arg Arg Val Ala Lys Arg Val Trp Arg Arg 
995 1000 1005 

gaa egg gac ctg acg ggc tgg atg teg ctg tea egg aaa ccg caa gtg 3072 
Glu Arg Asp Leu Thr Gly Trp Met Ser Leu Ser Arg Lys Pro Gin Val 
1010 1015 1020 

ace tgg tac ggc tgg gac ggc gac egg ctg acc aca ata cag aac gac 3120 
Thr Trp Tyr Gly Trp Asp Gly Asp Arg Leu Thr Thr lie Gin Asn Asp 
1025 1030 1035 1040 

aga acc cgc ate cag acg att tat cag ccg ggg age ttc acg cca etc 3168 
Arg Thr Arg lie Gin Thr lie Tyr Gin Pro Gly Ser Phe Thr Pro Leu 
1045 1050 1055 

ate aga gtt gaa acc gec acc ggt gag ctg gcg aaa acg cag cgc cgc 3216 
lie Arg Val Glu Thr Ala Thr Gly Glu Leu Ala Lys Thr Gin Arg Arg 
1060 1065 1070 

age ctg gcg gat gcg ctt cag cag tec ggc ggc gaa gac ggt ggc agt 3264 
Ser Leu Ala Asp Ala Leu Gin Gin Ser Gly Gly Glu Asp Gly Gly Ser 
1075 1080 * 1085 

gtg gtg ttc ccg ccg gtg ctg gtg cag atg etc gac egg ctg gaa agt 3312 
Val Val Phe Pro Pro Val Leu Val Gin Met Leu Asp Arg Leu Glu Ser 
1090 1095 1100 

gaa ate ctg get gac egg gtg agt gag gaa age cgc cgc tgg ctg gca 3360 
Glu lie Leu Ala Asp Arg Val Ser Glu Glu Ser Arg Arg Trp Leu Ala 
1105 1110 1115 1120 

teg tgc ggc ctg acc gtg gag cag atg caa aac cag atg gac ccg gtg 34 08 
Ser Cys Gly Leu Thr Val Glu Gin Met Gin Asn Gin Met Asp Pro Val 
1125 1130 1135 

tac acg ccg gcg cga aaa ate cac ctg tac cac tgc gac cat cgc ggc 3456 
Tyr Thr Pro Ala Arg Lys lie His Leu Tyr His Cys Asp His Arg Gly 
1140 1145 1150 

ctg ccg ctg gec ctt ate age aag gaa ggg aca aca gaa tgg tgc gca 3504 
Leu Pro Leu Ala Leu lie Ser Lys Glu Gly Thr Thr Glu Trp Cys Ala 
1155 1160 1165 

gaa tac gat gaa tgg ggc aac ctg ctg aat gaa gag aac ccg cat cag 3552 
Glu Tyr Asp Glu Trp Gly Asn Leu Leu Asn Glu Glu Asn Pro His Gin 
1170 1175 1180 

ctg cag cag ctt ate cgc ctg ccg ggg cag cag tat gat gag gag tec 3600 
Leu Gin Gin Leu lie Arg Leu Pro Gly Gin Gin Tyr Asp Glu Glu Ser 
1185 1190 1195 1200 

ggc ctg tat tac aac cgc cac cgc tat tat gac . ccg ctg cag ggg egg 3648 
Gly Leu Tyr Tyr Asn Arg His Arg Tyr Tyr Asp Pro Leu Gin Gly Arg 
1205 1210 1215 

tat ate act cag gat ccg att ggg ctg aag ggg gga tgg aat ttt tat 3696 
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Tyr lie Thr Gin Asp Pro lie Gly Leu Lys Gly Gly Tip Asn Phe Tyr 
1220 1225 1230 



cag tat ccg ttg aat cca gtt acg aat aca gat cct ctg ggg tta gaa 
Gin Tyr Pro Leu Asn Pro Val Thr Asn Thr Asp Pro Leu Gly Leu Glu 
1235 1240 1245 



3744 



gtt ttt cct aga cca ttc ccc ttg cca att cca tgg ccc aaa age cct 
Val Phe Pro Arg Pro Phe Pro Leu Pro lie Pro Trp Pro Lys Ser Pro 
1250 1255 1260 



3792 



gca cag cag caa gca gat gat aat get gca aaa gca ttg aca aaa tgg 
Ala Gin Gin Gin Ala Asp Asp Asn Ala Ala Lys Ala Leu Thr Lys Trp 
1265 1270 1275 1280 



3840 



tgg aac gat aca gca tea caa aga ata ttt gac tct eta ata ttg aat 
Trp Asn Asp Thr Ala Ser Gin Arg lie Phe Asp Ser Leu lie Leu Asn 
1285 1290 1295 



3888 



aat ccg gga eta gca tta gat ata aca atg ata get tct cgt gga aat 
Asn Pro Gly Leu Ala Leu Asp lie Thr Met lie Ala Ser Arg Gly Asn 
1300 1305 - 1310 



3936 



gtt gca gac aca ggg ata act gat cgt gtc aat gac ata ata aat gac 
Val Ala Asp Thr Gly lie Thr Asp Arg Val Asn Asp lie lie Asn Asp 
1315 1320 1325 



3984 



aga ttc tgg agt gat ggg aaa aaa ccc gac aga tgt gac gta ctt cag 
Arg Phe Trp Ser Asp Gly Lys Lys Pro Asp Arg Cys Asp Val Leu Gin 
1330 1335 1340 



4032 



gaa eta att gat tgt ggt gat att agt get aaa gat gca aaa age aca 
Glu Leu lie Asp Cys Gly Asp lie Ser Ala Lys Asp Ala Lys Ser Thr 
1345 1350 1355 1360 



4080 



cag aaa gee tgg aat tgt cgt cac tec aga cag tea aac gat aaa aaa 
Gin Lys Ala Trp Asn Cys Arg His Ser Arg Gin Ser Asn Asp Lys Lys 
1365 1370 1375 



4128 



aga tag 
Arg * 



4134 



<210> 297 
<211> 843 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1) . . . (843) 

<400> 297 

atg tea aat aca tac cag aaa aga aag gca agt aaa gag tat ggt tta 48 
Met Ser Asn Thr Tyr Gin Lys Arg Lys Ala Ser Lys Glu Tyr Gly Leu 
15 10 15 

tat aat caa tgt aag aaa eta aat gat gat gaa tta ttt cgc tta ctt 96 
Tyr Asn Gin Cys Lys Lys Leu Asn Asp Asp Glu Leu Phe Arg Leu Leu 
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20 25 30 

gat gat cac aat tec ttg aaa agg att tea tct gee aga gta tta cag 14 4 

Asp Asp His Asn Ser Leu Lys Arg lie Ser Ser Ala Arg Val Leu Gin 
35 40 45 

tta aga ggt ggg caa gac get gtt aga ttg gca att gag ttc tgc tct 192 
Leu Arg Gly Gly Gin Asp Ala Val Arg Leu Ala He Glu Phe Cys Ser 
50 55 60 

gat aaa aat tat ate cgt aga gat ate gga gca ttt ata etc ggg caa 240 
Asp Lys Asn Tyr He Arg Arg Asp He Gly Ala Phe He Leu Gly Gin 
65 70 75 80 

ata aaa att tgc aaa aaa tgc gaa gat aat gtt ttt aat att ttg aac 288 
lie Lys He Cys Lys Lys Cys Glu Asp Asn Val Phe Asn He Leu Asn 
85 90 95 

aat atg gca ttg aat gat aag age get tgc gtt cga get acg gca ate 336 
Asn Met Ala Leu Asn Asp Lys Ser Ala Cys Val Arg Ala Thr Ala He 
100 105 110 

gag tea acg get cag cga tgc aag aaa aac cca att tat tea cct aaa 384 
Glu Ser Thr Ala Gin Arg Cys Lys Lys Asn Pro He Tyr Ser Pro Lys 
115 120 125 

ata gta gaa caa tct caa att act get ttt gat aaa teg act aat gtc 432 
He Val Glu Gin Ser Gin He Thr Ala Phe Asp Lys Ser Thr Asn Val 
130 135 140 

aga cgt get aca gca ttt get att tct gtt ate aat gat aaa gca aca 480 
Arg Arg Ala Thr Ala Phe Ala He Ser Val He Asn Asp Lys Ala Thr 
145 ~ 150 155 160 

att cca eta ttg att aat ctg tta aaa gat cca aat gga gat gtc aga 528 
He Pro Leu Leu He Asn Leu Leu Lys Asp Pro Asn Gly Asp Val Arg 
165 170 175 

aac tgg gec gca ttt gca ata aat ate aat aaa tat gat aat agt gat 576 
Asn Trp Ala Ala Phe Ala lie Asn lie Asn Lys Tyr Asp Asn Ser Asp 
180 185 190 

att agg gat tgt ttt gtg gag atg ctt cag gat aaa aat gag gaa gtc 624 
lie Arg Asp Cys Phe Val Glu Met Leu Gin Asp Lys Asn Glu Glu Val 
195 200 205 

cgt att gaa gca ata ate gga ctt tec tac aga aaa gat aaa agg gtt 672 
Arg He Glu Ala He lie Gly Leu Ser Tyr Arg Lys Asp Lys Arg Val 
210 215 220 

tta tct gtt tta tgc gat gag tta aaa aaa aat act gtt tat gat gat 720 
Leu Ser Val Leu Cys Asp Glu Leu Lys Lys Asn Thr Val Tyr Asp Asp 
225 230 235 240 

ate att gaa get gcg ggt gaa tta ggt gat aaa acg eta ctt cct gtt 7 68 

He He Glu Ala Ala Gly Glu Leu Gly Asp Lys Thr Leu Leu Pro Val 
245 250 255 

tta gat act atg ttg tac aag ttt gat gac aat gaa att ata act tec 816 
Leu Asp Thr Met Leu Tyr Lys Phe Asp Asp Asn Glu lie lie Thr Ser 
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260 265 270 

get att gat aag ctg aag cgt tea tag 843 

Ala lie Asp Lys Leu Lys Arg Ser * 

275 ~ 280 



<210> 298 
<211> 702 
<212> DNA 

<213> Escherichia coli 

<220> 

<221> CDS 

<222> (1)...{702) 

<400> 298 

gtg cag atg etc gac egg ctg gaa agt gaa ate ctg get gac egg gtg 48 

Met Gin Met Leu Asp Arg Leu Glu Ser Glu lie Leu Ala Asp Arg Val 
15 10 15 

agt gag gaa age cgc cgc tgg ctg gca teg tgc ggc ctg acc gtg gag 96 
Ser Glu Glu Ser Arg Arg Trp Leu Ala Ser Cys Gly Leu Thr Val Glu 
20 25 30 

cag atg aaa aac cag atg gac ccg gtg tac acg ccg gcg cga aaa ate 14 4 

Gin Met Lys Asn Gin Met Asp Pro Val Tyr Thr Pro Ala Arg Lys lie 
35 40 4 5 

cac ctg tac cac tgc gac cat cgc ggc ctg ccg ctg gec ctt ate age 192 
His Leu Tyr His Cys Asp His Arg Gly Leu Pro Leu Ala Leu He Ser 
50 55 60 

aag gaa ggg aca aca gaa tgg tgc gca gaa tac gat gaa tgg ggc aac 240 
Lys Glu Gly Thr Thr Glu Trp Cys Ala Glu Tyr Asp Glu Trp Gly Asn 
65 70 75 80 

ctg ctg aat gaa gag aac ccg cat cag ctg cag cag ctt ate cgc ctg 288 
Leu Leu Asn Glu Glu Asn Pro His Gin Leu Gin Gin Leu He Arg Leu 
85 90 95 

ccg ggg cag cag tat gat gag gag tec ggc ctg tat tac aac cgc cac 336 
Pro Gly Gin Gin Tyr Asp Glu Glu Ser Gly Leu Tyr Tyr Asn Arg His 
100 105 110 

cgc tat tat gac ccg ctg cag ggg egg tat ate act cag gat ccg att 384 
Arg Tyr Tyr Asp Pro Leu Gin Gly Arg Tyr He Thr Gin Asp Pro He 
115 120 125 

ggg ctg aag ggg gga tgg aat ttt tat cag tat ccg ttg aat ccg gtc 432 
Gly Leu Lys Gly Gly Trp Asn Phe Tyr Gin Tyr Pro Leu Asn Pro Val 
130 135 140 

ata aat gta gat ccg caa ggt ttg gtt gat ata aat tta tac ccc gaa 480 
He Asn Val Asp Pro Gin Gly Leu Val Asp He Asn Leu Tyr Pro Glu 
145 150 155 160 

agt gat ctt ate cat tct gta get gat gag att aat ate cca ggc gtt 528 
Ser Asp Leu He His Ser Val Ala Asp Glu He Asn He Pro Gly Val 
165 170 175 
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ttc aca ate ggg ggg cat ggt acc ccc aca tct att gaa tec gca acg 57 6 

Phe Thr lie Gly Gly His Gly Thr Pro Thr Ser lie Glu Ser Ala Thr 
180 185 190 

cgc agt ate atg aca get aaa gat eta gca tat eta att aaa ttt gat 624 
Arg Ser lie Met Thr Ala Lys Asp Leu Ala Tyr Leu lie Lys Phe Asp 
195 200 205 

ggg aat tat aaa gat ggg atg aca gtt tgg tta ttt tct tgt aat aca 672 
Gly Asn Tyr Lys Asp Gly Met Thr Val Trp Leu Phe Ser Cys Asn Thr 
210 215 220 

ggt aaa gga caa aat tea ttt get age taa 702 
Gly Lys Gly Gin Asn Ser Phe Ala Ser * 
225 230 



<210> 299 
<211> 320 
<212> PRT 

<213> Escherichia coli 



<400> 299 



Met Met 


He 


Lys 


Thr 


Arg 


Phe 


Ser 


Arg 


Trp Leu Thr 


Phe Phe Thr Phe 


1 




r 

D 










10 


15 


Ala Ala 


Ala 


Val 


Ala 


Leu 


Ala 


Leu 


Pro 


Ala Lys Ala 


Asn Thr Trp Pro 






20 














30 


Leu Pro 


Pro 


Ala 


Gly 


Ser 


Arg 


Leu 


Val 


Gly Glu Asn Lys Phe His Val 




35 










40 






45 


Val Glu 


Asn 


Asp 


Gly 


Gly 


Ser 


Leu 


Glu 


Ala He Ala 


Lys Lys Tyr Asn 


50 










55 






60 




Val Gly 


Phe 


Leu 


Ala 


Leu 


Leu 


Gin 


Ala 


Asn Pro Gly Val Asp Pro Tyr 


65 








70 








75 


80 


Val Pro 


Arg 


Ala 


Gly 


Ser 


Val 


Leu 


Thr 


He Pro Leu 


Gin Thr Leu Leu 








85 










90 


95 


Pro Asp 


Ala 


Pro 


Arg 


Glu 


Gly 


He 


Val 


He Asn He 


Ala Glu Leu Arg 






100 










105 




110 


Leu Tyr 


Tyr 


Tyr 


Pro 


Pro 


Gly 


Lys 


Asn 


Ser Val Thr 


Val Tyr Pro He 




115 










120 






125 


Gly He 


Gly 


Gin 


Leu 


Gly 


Gly 


Asp 


Thr 


Leu Thr Pro 


Thr Met Val Thr 


130 










135 






140 




Thr Val 


Ser 


Asp 


Lys 


Arg 


Ala 


Asn 


Pro 


Thr Trp Thr 


Pro Thr Ala Asn 


145 








150 








155 


160 


He Arg 


Ala 


Arg 


Tyr 


Lys 


Ala 


Gin 


Gly 


He Glu Leu 


Pro Ala Val Val 








165 










170 


175 


Pro Ala 


Gly 


Leu 


Asp 


Asn 


Pro 


Met 


Gly 


His His Ala 


He Arg Leu Ala 






180 










185 




190 


Ala Tyr 


Gly 


Gly 


Val 


Tyr 


Leu 


Leu 


His 


Gly Thr Asn Ala Asp Phe Gly 




195 










200 






205 


He Gly 


Met 


Arg 


Val 


Ser 


Ser 


Gly 


Cys 


He Arg Leu 


Arg Asp Asp Asp 


210 










215 






220 




He Lys 


Thr 


Leu 


Phe 


Ser 


Gin 


Val 


Thr 


Pro Gly Thr Lys Val Asn He 


225 








230 








235 


240 


He Asn 


Thr 


Pro 


He 


Lys 


Val 


Ser 


Ala 


Glu Pro Asn 


Gly Ala Arg Leu 








245 










250 


255 


Val Glu 


Val 


His 


Gin 


Pro 


Leu 


Ser 


Glu 


Lys He Asp Asp Asp Pro Gin 






260 










265 




270 


Leu Leu 


Pro 


He 


Thr 


Leu 


Asn 


Ser 


Ala 


Met Gin Ser 


Phe Lys Asp Ala 




275 










280 






285 
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Ala Gin Thr Asp Ala Glu 
290 

Gly Met Pro Val Asp Val 
305 310 

<210> 300 
<211> 728 
<212> PRT 

<213> Escherichia coli 



<400> 300 



Met 


He 


Thr 


Arg 


He 


Pro 


1 








5 




Thr 


Ala 


Gin 


Thr 


Asn 


Glu 








20 






Glu 


Leu 


Glu 


Asp 


Lys 


Phe 






35 








Lys 


Ser 


Phe 


Ser 


Glu 


Asn 




50 










Asn 


Lys 


Asp 


Phe 


Val 


He 


65 










70 


Lys 


Lys 


Glu 


He 


Thr 


He 










85 




Gin 


Thr 


He 


Ala 


Pro 


He 








100 






Gin 


Arg 


Ser 


Asp 


Thr 


Leu 






115 








Arg 


Lys 


Glu 


Lys 


Glu 


Phe 




130 










Phe 


Pro 


Gly 


Arg 


Pro 


Gin 


145 










150 


Gin 


Tyr 


He 


Asn 


Asp 


He 










165 




Asp Asp 


Tyr 


Ser 


Ser 


Ala 








180 






Cys 


Lys 


Asn 


Gin 


Thr 


Leu 






195 








Cys 


Val 


Ala 


Ala 


Cys 


Gly 




210 










Asp 


Lys 


Met 


Glu 


Ser 


Asn 


225 










230 


Val 


Asp 


Asn 


His 


Ala 


Leu 










245 




Glu 


Gly 


Cys 


Thr 


His 


Tyr 








260 






Asn Asp 


Lys 


He 


Arg 


He 






275 








Tyr 


Ser 


Leu 


Met 


Asp 


Phe 




290 










Ser 


Asn 


Asp 


His 


Val 


He 


305 










310 


Pro 


Lys 


Glu 


Gin 


Leu 


Leu 










325 




Pro 


Pro 


Leu 


Ser 


Pro 


Ala 








340 






Asn 


His 


Gin 


Leu 


Thr 


Gin 






355 








Leu 


His 


Arg 


Thr 


Glu 


Met 




370 










Gly Leu 


Tyr 


Leu 


Ala 


He 



Val 


Met 


Gin 


His 


Val 


Met 


295 










300 


Arg 


Arg 


His 


Gin 


Val 


Ser 










315 




Arg 


Ser 


Ser 


Phe 


Ser 


Ala 








10 






His 


Gin 


Thr 


Leu 


Ser 


Glu 






25 








Ser 


Gly 


Lys 


Glu 


Leu 


Ala 




40 










Cys 


Arg 


Gin 


Asn 


Gly 


Arg 


55 










60 


Lys 


Phe 


Ser 


Thr 


Ser 


Val 










75 




He 


Asn 


Lys 


Asn 


Glu 


Asn 








90 






Phe 


Glu 


Lys 


Tyr 


Leu 


Met 






105 








Asp 


Lys 


Gin 


Glu 


Leu 


Asn 




120 










Pro 


Arg 


He 


Lys 


Leu 


Asn 


135 










140 


Asn 


Arg 


He 


Val 


Cys 


Arg 










155 




Tyr 


Gin 


Asn 


Val 


Asp 


Tyr 








170 






Glu 


Lys 


Phe 


Leu 


Thr 


His 






185 








Ala 


Leu 


Val 


Ser 


Ser 


Arg 




200 










Asp 


Phe 


Gly 


Leu 


Val 


Met 


215 










220 


Gly 


He 


Ser 


Val 


Met 


Ala 










235 




Thr 


Val 


Arg 


Leu 


Arg 


He 








250 






Val 


Val 


Ser 


Val 


Tyr 


Asp 






265 








Met 


Ser 


Glu 


Ser 


Lys 


Glu 




280 










Met 


Asn 


Val 


Asp 


Tyr 


Ser 


295 










300 


Asn 


Gin 


Ser 


Val 


Ala 


lie 










315 




Met 


Leu 


Lys 


Gly 


Ser 


Val 








330 






Thr 


Met 


Asn 


Leu 


Leu 


Met 






345 








Leu 


Met 


lie 


Gin 


Leu 


Gin 




360 










Leu 


Thr 


Ala 


Tyr 


Asn 


Ser 


375 










380 


Asn 


Tyr 


Gly 


Asn 


Ala 


Asp 
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Asp 


val 


Arg 


Ser 


Pro 


Gin 


Thr 


Leu 








320 


Asn 


lie 


Asn 


Asn 






15 




Leu 


Phe 


Tyr 


Lys 




30 






Thr 


Pro 


Leu 


Leu 


45 








His 


lie 


Phe 


Ser 


Leu 


Gin 


Ala 


Asp 








80 


Thr 


Thr 


Leu 


Thr 






95 




Glu 


He 


Leu 


Pro 




110 






Leu 


Lys 


Ser 


Asp 


125 








Gly 


Gin 


Cys 


Tyr 


His 


lie 


Ala 


Ala 








160 


Lys 


Pro 


His 


Gin 






175 




Phe 


Asn 


Lys 


Lys 




190 






Pro 


Glu 


Gly 


Arg 


205 








Lys 


Ala 


Tyr 


Phe 


Ala 


lie 


Leu 


Leu 








240 


Lys 


Asn 


Thr 


Thr 






255 




Pro 


Asn 


Val 


Thr 




270 






Asn 


lie 


Lys 


His 


285 








Leu 


Leu 


Lys 


Trp 


lie 


Pro 


Ala 


Leu 








320 


Asp 


Glu 


lie 


Thr 






335 




Ala 


lie 


Gly 


Gin 




350 






Lys 


Met 


Pro 


Glu 


365 








lie 


Asn 


Leu 


Pro 


lie 


Val 


Glu 


Thr 
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385 390 



lie Phe 


Asn 


Ser 


Leu 
405 


Ser 


Glu 


Thr 


Lys Asn 


Leu 


Met 
420 


His 


He 


Leu 


Glu 


Gly Leu 


Phe 


Leu Ala 


He 


Ser 


Arg 




435 










A A Cl 

4 4 U 


lie Leu 


Asn 


Ala 


Leu 


Pro Lys 


Leu 


450 










455 




Glu Gin 


Val 


Tyr Lys 


Phe 


Leu 


5er 


465 








470 






Val Leu 


Tyr 


His 


Val 
485 


Met 


Ala 


Asn 


Val Leu 


Asn 


Ala 
500 


Leu 


Pro 


Leu 


Leu 


Glu Gin 


Val 


Leu Asp 


Leu 


Leu 


Lys 




515 










520 


Gly Leu 


Tyr 


Leu 


Ala 


Met 


Gin 


Asn 


530 










535 




lie Leu 


Glu 


Ala 


Leu 


Pro 


Ser 


Leu 


545 








550 






Ser Asp 


He 


Val Asp 


Leu 


Leu 


Tnr 






565 








Gly Leu 


Phe 


Met 
580 


Ala 


Met 


Gin 


Arg 


He Phe 


Asn 
595 


Ala 


Leu 


Pro 


Thr 


Leu 

600 


Lys Asn 


Met 


Lys 


Pro 


Leu 


Leu 


Leu 


610 










615 




Gly Leu 


Phe 


Ser 


Ala 


He 


Gin 


HIS 


625 








630 






Val Tyr 


Leu 


Ala 


Leu 


Ser Asp 


HlS 








645 








Glu Asp 


He 


Met 
660 


Asp 


Phe 


Trp 


Gin 


Ala Phe 


Glu 
675 


Leu 


Ala 


Phe 


Glu 


Phe 
680 


He Leu 


Asn 


Thr 


Leu 


Asn 


Lys 


Met 


690 










695 




Asn Pro 


Arg 


Tyr 


He 


Ala 


Glu 


Lys 


705 








710 






Lys Ala 


Ser 


Pro 


His 
725 


Thr 


Val 


Arg 



<210> 301 
<211> 84 
<212> PRT 

<213> Escherichia coli 
<400> 301 

Met Thr Asp Lys lie Arg Thr Leu 

1 5 
Met Glu Lys Ser He Val Val Ala 
20 

He Tyr Gly Lys Phe He Lys Arg 

35 40 
Glu Asn Asn Glu Cys Gly He Gly 

50 55 
Arg Pro Leu Ser Lys Thr Lys Ser 
65 70 



395 400 



Gly Tyr Glu Gly 


Leu 


Leu 


Ser 


Lys 


410 






415 




Ala Lys Asp Lys Asn 


biy 


rne 


C A V 

ber 


425 




A OA 

4 30 






Lys Asp Lys Asn 


Val 


vai 


inr 


ber 




445 








Ala Ala Thr His 


His 


Leu 


ASp 


Asn 


4 60 










Ala Lys Asn Arg Thr 


Ser 


ser 


HIS 


475 








4B0 


Gly Asp Ala Asp 


Met 


Leu 


Lys 


He 


4 90 






4 95 




He Arg Thr' Cys 


His 


Leu 


Thr 


Lys 


505 




510 






Ala Lys Asp Phe 


Tyr 


Gly 


Cys 


Pro 




525 








Gly His Ser Asp 


He 


Val 


Lys 


Val 


540 










Ala Gin Glu He 


Asn 


He 


Ser 


Ala 


555 








560 


Ala Lys Ser Leu 


Ala 


Arg 


Asp 


Thr 


570 






575 




Gly His Met Asn Val 


He 


Asn 


Tnr 


585 




con 

590 






Phe Asn Thr Phe Lys 


ir ne 


Asp 


Lys 




605 








Ala Asn Asn Ser 


Asn 


biU 


Tyr 


Pro 


620 










Lys Gin Gin Asn 


Val 


vai 


IjIU 


inr 


635 








04U 


Ala Arg Leu Phe Gly 


t*ne 


i nr 


Hid 


650 






655 




His Lys Ala Pro 


Gin 


Lys 


Tyr 


Ser 


665 




670 






Gly His Arg Val 


He 


Ala 


Glu 


Leu 




685 








Ala Glu Ser Phe 


Gly 


Phe 


Thr 


Asp 


700 










Asn Tyr Met Glu 


Ala 


Leu 


Leu 


Lys 


715 








720 


Gin Gly Arg Val 


Val 


ber 


Asp 


Lys 


10 










He Glu Arg Phe 


Val 


Lys 


His 


Pro 


25 




30 






Thr Thr Lys Leu 


His 


Val 


His 


Asp 




45 








Asp Val Val Glu 


He 


Arg 


Glu 


Cys 


60 










Trp Thr Leu Val 


Arg 


Val 


Val 


Glu 


75 








80 
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Lys Ala Val Leu 



<210> 302 
<211> 63 
<212> PRT 

<213> Escherichia coli 



<400> 302 

Met Lys Ala Lys Glu Leu Arg Glu 

1 5 
Glu Leu Leu Asn Leu Leu Arg Glu 
20 

Ala Ser Gly Gin Leu Gin Gin Ser 

35 40 
Asp Val Ala Arg Val Lys Thr Leu 
50 55 



Lys Ser Val Glu Glu Leu Asn Thr 

10 15 
Gin Phe Asn Leu Arg Met Gin Ala 
25 30 
His Leu Leu Lys Gin Val Arg Arg 
45 

Leu Asn Glu Lys Ala Gly Ala 
60 



<210> 303 
<211> 136 
<212> PRT 

<213> Escherichia coli 



<400> 303 




















Met 


Leu 


Gin 


Pro Lys Arg Thr Lys 


Phe Arg Lys 


Met 


His 


Lys Gly Arg 


1 






5 






10 










15 


Asn 


Arg 


Gly Leu Ala 


Gin Gly Thr Asp Val 


Ser 


Phe Gly Ser 


Phe Gly 








20 




25 










30 




Leu 


Lys 


Ala 


Val Gly Arg Gly Arg 


Leu 


Thr 


Ala 


Arg 


Gin 


He 


Glu Ala 






35 




40 










45 






Ala 


Arg 


Arg Ala Met 


Thr Arg Ala 


Val 


Lys 


Arg 


Gin 


Gly Lys 


He Trp 




50 






55 








60 








He 


Arg 


Val 


Phe Pro 


Asp Lys Pro 


He 


Thr 


Glu 


Lys 


Pro 


Leu 


Ala Val 


65 








70 






75 








80 


Arg 


Met 


Gly 


Lys Gly 


Lys Gly Asn Val 


Glu 


Tyr 


Trp Val 


Ala 


Leu He 








85 






90 










95 


Gin 


Pro 


Gly 


Lys Val 


Leu Tyr Glu 


Met 


Asp Gly 


Val 


Pro 


Glu 


Glu Leu 








100 




105 










110 




Ala 


Arg 


Glu 


Ala Phe 


Lys Leu Ala 


Ala 


Ala 


Lys 


Leu 


Pro 


He Lys Thr 






115 




120 










125 






Thr 


Phe 
130 


Val 


Thr Lys 


Thr Val Met 
135 

















<210> 304 
<211> 233 
<212> PRT 

<213> Escherichia coli 



<400> 304 










Met Gly 


Gin 


Lys 


Val 


His 


Pro Asn 


1 






5 






Pro >Trp 


Asn 


Ser 


Thr 


Trp 


Phe Ala 






20 








Leu Asp 


Ser 


Asp 


Phe 


Lys 


Val Arg 




35 








40 


Lys Ala 


Ser 


Val 


Ser 


Arg 


He Val 


50 










55 


Arg Val 


Thr 


He 


His 


Thr Ala Arg 


65 








70 




Gly Glu 


Asp 


Val 


Glu 


Lys 


Leu Arg 



Gly He Arg Leu Gly He Val Lys 

10 15 
Asn Thr Lys Glu Phe Ala Asp Asn 
25 30 
Gin Tyr Leu Thr Lys Glu Leu Ala 
45 

lie Glu Arg Pro Ala Lys Ser He 
60 

Pro Gly He Val He Gly Lys Lys 

75 80 
Lys Val Val Ala Asp He Ala Gly 
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85 



If- 1 

vai 


Pro 


Ala Gin lie 


Asn 


He 


Ala Glu 






luu 






105 


Ala 


Lys 


Leu vai Ala 


Asp 


Ser 


He Thr 






TIC 

115 






120 


Met 


Phe 


Arg Arg Ala 


Met 


Lys 


Arg Ala 




130 






135 




Gly 


Ala 


Lys Gly lie 


Lys 


Vai 


Glu Vai 


145 






150 






Glu 


lie 


Ala Arg Thr 


G1U 


Trp Tyr Arg 






165 








Thr 


Leu 


Arg Ala Asp 


He 


Asp 


Tyr Asn 






180 






185 


Tyr 


Gly 


Vai lie Gly 


Vai 


Lys 


Vai Trp 






195 






200 


Gly 


Gly 


Met Ala Ala 


Vai 


Glu 


Gin Pro 




210 






215 




Lys 


Lys 


Gin Gin Arg 


Lys 


Gly Arg Lys 


225 






230 







<210> 305 
<211> 110 
<212> PRT 

<213> Escherichia coli 
<400> 305 



Met Glu 


Thr 


He 


Ala 


Lys 


His Arg His 


1 






5 






Vai Arg 


Leu 


Vai 


Ala 


Asp 


Leu He Arg 






20 






25 


Leu Asp 


He 


Leu 


Thr 


Tyr 


Thr Asn Lys 




35 








40 


Lys Vai 


Leu 


Glu 


Ser 


Ala 


He Ala Asn 


50 










55 


Asp He 


Asp 


Asp 


Leu 


Lys 


Vai Thr Lys 


65 








70 




Ser Met 


Lys 


Arg 


He 


Met 


Pro Arg Ala 








85 






Leu Lys 


Arg 


Thr 


Ser 


His 


He Thr Vai 






100 






105 



<210> 306 
<211> 92 
<212> PRT 

<213> Escherichia coli 
<400> 306 



Met Pro Arg 


Ser 


Leu 


Lys 


Lys Gly Pro 


1 




5 






Lys Lys Vai 


Glu 


Lys 


Ala 


Vai Glu Ser 




20 






25 


Thr Trp Ser 


Arg 


Arg 


Ser 


Thr He Phe 


35 








40 


He Ala Vai 


His 


Asn 


Gly 


Arg Gin His 


50 








55 


Glu Met Vai 


Gly 


His 


Lys 


Leu Gly Glu 


65 






70 




Arg Gly His 


Ala 


Ala 


Asp 


Lys Lys Ala 






85 
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yyj 














vai 


Arg 


Lys 


rro 




Leu 


T\ — _ 

ASp 










1 1U 






ber 


Gin 


Leu 


CjIU 


Arg 


Arg 


Vai 








IZD 








vai 


bin 


Asn 


A "1 -» 

Ala 


net 


Arg 


Leu 






14 U 










Ser 


Gly 


Arg 


Leu 


Gly 


Gly 


Ala 




ice 

155 










160 


Glu 


Gly 


Arg 


Vai 


Pro 


Leu 


His 


170 










175 




Thr 


Ser 


Glu 


Ala 


His 


Thr 


Thr 










190 






He 


Phe 


Lys 


Gly 


Glu 


He 


Leu 








205 








Glu 


Lys 


Pro 


Ala 


Ala 


Gin 


Pro 






220 










Ala 


Arg 


Ser 


ser 


Ala 


Gin 


Lys 


1U 










1 c 
lj 




Lily 


T • we* 

Lys 


Lys 


vai 


oer 


bin 


A 1 

Ala 
















Lys 


Ala 


Ala 


vai 


T ^ , i 

Leu 


vai 


T ■ re* 

Lys 








A C 

H 3 








Ala 


Glu 


His 


Asn 


Asp 


Gly 


Ala 






60 










He 


Phe 


Vai 


Asp 


Glu 


Gly 


Pro 




75 










80 


Lys 


Gly 


Arg 


Ala 


Asp 


Arg 


He 


90 










95 




Vai 


Vai 


Ser 


Asp 


Arg 














110 






pne 


T 1 _ 

lie 


TV — _ 

ASp 


Leu 


HIS 


Leu 


Leu 


1 u 










J. D 




pi 

\j±y 


Asp 


Lys 


Lys 


Pro 


Leu 


Arg 










30 






Pro 


Asn 


Met 


He 


Gly 


Leu 


Thr 








45 








Vai 


Pro 


Vai 


Phe 


Vai 


Thr 


Asp 






60 










Phe 


Ala 


Pro 


Thr 


Arg 


Thr 


Tyr 




75 










80 


Lys 
an 


Lys 


Lys 











90 
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<210> 307 
<211> 273 
<212> PRT 

<213> Escherichia coli 



<400> 307 



Met 


Ala 


Val 


Val 


Lys 


Cys 


Lys Pro Thr 


Ser 


Pro 


Gly 


Arg 


Arg 


His 


Val 


1 








5 






10 










15 




Val 


Lys 


Val 


Val 


Asn 


Pro 


Glu Leu His 


Lys 


Gly 


Lys 


Pro 


Phe 


Ala 


Pro 








20 






25 










30 






Leu 


Leu 


Glu 


Lys 


Asn 


Ser 


Lys Ser Gly 


Gly 


Arg 


Asn 


Asn 


Asn 


Gly 


Arg 






35 








40 








45 








He 


Thr 


Thr 


Arg 


His 


He 


Gly Gly Gly 


His 


Lys 


Gin 


Ala 


Tyr 


Arg 


He 




50 










55 






60 










Val 


Asp 


Phe 


Lys 


Arg 


Asn 


Lys Asp Gly 


He 


Pro 


Ala 


Val 


Val 


Glu 


Arg 


65 










70 






75 










80 


Leu 


Glu 


Tyr 


Asp 


Pro 


Asn 


Arg Ser Ala 


Asn 


He 


Ala 


Leu 


Val 


Leu 


Tyr 










85 






90 










95 




Lys 


Asp 


Gly 


Glu 


Arg 


Arg 


Tyr He Leu 


Ala 


Pro 


Lys 


Gly 


Leu 


Lys 


Ala 








100 






105 










110 






Gly 


Asp 


Gin 


He 


Gin 


Ser 


Gly Val Asp 


Ala 


Ala 


He 


Lys 


Pro 


Gly 


Asn 






115 








120 








125 








Thr 


Leu 


Pro 


Met 


Arg 


Asn 


He Pro Val 


Gly 


Ser 


Thr 


Val 


His 


Asn 


Val 




130 










135 






140 










Glu 


Met 


Lys 


Pro 


Gly 


Lys 


Gly Gly Gin 


Leu 


Ala 


Arg 


Ser 


Ala 


Gly 


Thr 


145 










150 






155 










160 


Tyr 


Val 


Gin 


He 


Val 


Ala 


Arg Asp Gly 


Ala 


Tyr 


Val 


Thr 


Leu 


Arg 


Leu 










165 






170 










175 




Arg 


Ser 


Gly 


Glu 


Met 


Arg 


Lys Val Glu 


Ala 


Asp 


Cys 


Arg 


Ala 


Thr 


Leu 








180 






185 










190 






Gly 


Glu 


Val 


Gly 


Asn 


Ala 


Glu His Met 


Leu 


Arg 


Val 


Leu 


Gly 


Lys 


Ala 






195 








200 








205 








Gly 


Ala 


Ala 


Arg 


Trp 


Arg 


Gly Val Arg 


Pro 


Thr 


Val 


Arg 


Gly 


Thr 


Ala 




210 










215 






220 










Met 


Asn 


Pro 


Val 


Asp 


His 


Pro His Gly 


Gly 


Gly 


Glu 


Gly 


Arg 


Asn 


Phe 


225 










230 






235 










240 


Gly 


Lys 


His 


Pro 


Val 


Thr 


Pro Trp Gly 


Val 


Gin 


Thr 


Lys 


Gly 


Lys 


Lys 










245 






250 










255 




Thr 


Arg 


Ser 


Asn 


Lys 


Arg 


Thr Asp Lys 


Phe 


He 


Val 


Arg 


Arg 


Arg 


Ser 








260 






265 










270 







Lys 



<210> 308 
<211> 100 
<212> PRT 

<213> Escherichia coli 



<400> 308 



Met 


He 


Arg 


Glu 


Glu 


Arg 


Leu Leu Lys 


Val 


Leu 


Arg 


Ala 


Pro 


His 


Val 


1 








5 






10 










15 




Ser 


Glu 


Lys 


Ala 


Ser 


Thr 


Ala Met Glu 


Lys 


Ser 


Asn 


Thr 


He 


Val 


Leu 








20 






25 










30 






Lys 


Val 


Ala 


Lys 


Asp 


Ala 


Thr Lys Ala 


Glu 


He 


Lys 


Ala 


Ala 


Val 


Gin 






35 








40 








45 








Lys 


Leu 


Phe 


Glu 


Val 


Glu 


Val Glu Val 


Val 


Asn 


Thr 


Leu 


Val 


Val 


Lys 




50 










55 






60 










Gly 


Lys 


Val 


Lys 


Arg 


His 


Gly Gin Arg 


He 


Gly 


Arg 


Arg 


Ser 


Asp 


Trp 


65 










70 






75 










80 


Lys 


Lys 


Ala 


Tyr 


Val 


Thr 


Leu Lys Glu 


Gly 


Gin 


Asn 


Leu 


Asp 


Phe 


Val 
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85 90 95 

Gly Gly Ala Glu 
100 

<210> 309 
<211> 201 
<212> PRT 

<213> Escherichia coli 
<400> 309 



Met 


Glu 


Leu 


val 


Leu 


Lys 


Asp Ala Gin Ser Ala Leu Thr 


\7=»1 Oar 

vai oer 


1*1 u 


1 








c 
D 






10 


X o 




Thr 


Thr 


Pile 


c?ly 


Arg 


ASp 


Phe Asn Glu 


Ala Leu Val His 


Pin \7a 1 


vai 














25 








vai 


Ala 


Tyr 


Ala 


Ala 


biy 


Ala Arg Gin Gly Thr Arg Ala 


Gin Lys 


i nr 






Jo 








40 


45 






Arg 


Ala 


CjIU 


\T- 1 

vai 


rpl_ v~ 

inr 


pi 

biy 


Ser Gly Lys 


Lys Pro Trp Arg 


Gin Lys 


fii \/ 
uiy 




en 
DU 










55 


60 






rp I_ v_ 


Cj±y 


Arg 


AJ.a 


Arg 


Ser 


Gly Ser He 


Lys Ser Pro He 






65 










70 




75 




80 


Gly 


Gly 


Val 


Thr 


Phe 


Ala 


Ala Arg Pro 


Gin Asp His Ser 


Gin Lys 


Val 










85 






90 


95 




Asn 


Lys 


Lys 


Met 


Tyr 


Arg 


Gly Ala Leu 


Lys Ser He Leu 


Ser Glu 


Leu 








100 






105 




110 




Val 


Arg 


Gin 


Asp 


Arg 


Leu 


He Val . Val 


Glu Lys. Phe Ser 


Val Glu 


Ala 






115 








120 


125 






Pro 


Lys 


Thr 


Lys 


Leu 


Leu 


Ala Gin Lys 


Leu Lys Asp Met 


Ala Leu 


Glu 




130 










135 


140 






Asp 


Val 


Leu 


He 


He 


Thr 


Gly Glu Leu Asp Glu Asn Leu 


Phe Leu 


Ala 


145 










150 




155 




160 


Ala 


Arg 


Asn 


Leu 


His 


Lys 


Val Asp Val Arg Asp Ala Thr 


Gly lie 


Asp 










165 






170 


175 




Pro 


Val 


Ser 


Leu 


He 


Ala 


Phe Asp Lys 


Val Val Met Thr 


Ala Asp 


Ala 








180 






185 




190 




Val 


Lys 


Gin 


Val 


Glu 


Glu 


Met Leu Ala 












195 








200 









<210> 310 
<211> 209 
<212> PRT 

<213> Escherichia coli 
<400> 310 



Met 


He 


Gly 


Leu 


Val 


Gly 


Lys Lys Val Gly Met Thr Arg 


lie Phe Thr 


1 








5 




10 


15 


Glu 


Asp 


Gly 


Val 


Ser 


He 


Pro Val Thr Val He Glu Val 


Glu Ala Asn 




20 






25 


30 


Arg 


Val 


Thr 


Gin 


Val 


Lys 


Asp Leu Ala Asn Asp Gly Tyr 


Arg Ala He 




35 








40 45 




Gin 


Val 


Thr 


Thr 


Gly 


Ala 


Lys Lys Ala Asn Arg Val Thr 


Lys Pro Glu 




50 










55 60 




Ala 


Gly 


His 


Phe 


Ala 


Lys 


Ala Gly Val Glu Ala Gly Arg 


Gly Leu. Trp 


65 










70 


75 


80 


Glu 


Phe 


Arg 


Leu 


Ala 


Glu 


Gly Glu Glu Phe Thr Val Gly 


Gin Ser lie 








85 




90 


95 


Ser 


Val 


Glu 


Leu 


Phe 


Ala 


Asp Val Lys Lys Val Asp Val 


Thr Gly Thr 








100 






105 


110 


Ser 


Lys 


Gly 


Lys 


Gly 


Phe 


Ala Gly Thr Val Lys Arg Trp 


Asn Phe Arg 






115 








120 125 




Thr 


Gin 


Asp 


Ala 


Thr 


His 


Gly Asn Ser Leu Ser His Arg 


Val Pro Gly 
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130 






135 


Ser 


He Gly 


Gin 


Asn Gin 


Thr Pro 


145 






150 




Met 


Ala Gly 


Gin 


Met Gly 


Asn Glu 








165 




Val 


Val Arg 


Val 


Asp Ala 


Glu Arg 






180 






Val 


Pro Gly 


Ala 


Thr Gly 


Ser Asp 




195 






200 


Ala 











140 



Gly Lys 


Val 


Phe 


Lys Gly Lys 


Lys 




155 










160 


Arg Val 


Thr 


Val 


Gin 


Ser 


Leu 


Asp 


170 










175 




Asn Leu 


Leu 


Leu 


Val 


Lys 


Gly 


Ala 


185 








190 






Leu He 


Val 


Lys 


Pro 
205 


Ala 


Val 


Lys 



<210> 311 
<211> 103 
<212> PRT 

<213> Escherichia coli 
<400> 311 



Met Gin 


Asn 


Gin 


Arg 


He 


Arg 


He 


Arg Leu Lys Ala 


Phe Asp His 


Arg 


1 






5 








10 


15 


Leu He 


Asp 


Gin 
20 


Ala 


Thr 


Ala 


Glu 


He Val Glu Thr 
25 


Ala Lys Arg 
30 


Thr 


Gly Ala 


Gin 
35 


Val 


Arg 


Gly 


Pro 


lie 
40 


Pro Leu Pro Thr 


Arg Lys Glu 
45 


Arg 


Phe Thr 


Val 


Leu 


lie 


Ser 


Pro 


His 


Val Asn Lys Asp 


Ala Arg Asp 


Gin 


50 










55 




60 






Tyr Glu 


He 


Arg 


Thr 


His 


Leu Arg 


Leu Val Asp lie 


Val Glu Pro 


Thr 


65 








70 






75 




80 


Glu Lys 


Thr 


Val 


Asp 
85 


Ala 


Leu 


Met 


Arg Leu Asp Leu 
90 


Ala Ala Gly 
95 


Val 


Asp Val 


Gin 


He 
100 


Ser 


Leu 


Gly 











<210> 312 
<211> 1569 
<212> PRT 

<213> Escherichia coli 
<400> 312 



Met 


Asn 


Thr 


lie 


His 


Leu 


Arg Cys 


Leu Phe Arg Met 


Asn Pro 


Leu 


Val 


1 








5 






10 




15 




Trp 


Cys 


Leu 


Trp 


Ala 


Asp 


Val Ala 


Ala Lys Leu Arg 


Ser Leu 


Lys 


Arg 








20 








25 


30 






Tyr 


Ser 


Val 


Phe 


Thr 


Phe 


Gin Arg 


Met Lys Phe Met 


Asn Arg 


Thr 


Ser 






35 








40 




45 






Pro 


Tyr 


Tyr 


Cys 


Arg 


Arg 


Ser Val 


Leu Ser Leu Leu 


lie Ser 


Ala 


Leu 




50 










55 


60 








lie 


Tyr 


Ala 


Pro 


Pro 


Gly 


Met Ala 


Ala Phe Thr Thr 


Asn Val 


He 


Gly 


65 










70 




75 






80 


Val 


Val 


Asn 


Asp 


Glu 


Thr 


Val Asp 


Gly Asn Gin Lys 


Val Asp Glu 


Arg 










85 






90 




95 




Gly 


Thr 


Thr 


Asn 


Asn 


Thr 


His lie 


He Asn His Gly 


Gin Gin 


Asn 


Val 








100 








105 


110 






His 


Gly 


Gly 


Val 


Ser 


Asn 


Gly Ser 


Leu lie Glu Ser 


Gly Gly Tyr 


Gin 






115 








120 




125 






Asp 


He 


Gly 


Ser 


His 


Asn 


Asn Phe 


Val Gly Gin Ala 


Asn Asn 


Thr 


Thr 




130 










135 


140 








He 


Asn 


Gly 


Gly 


Arg 


Gin 


Ser He 


His Asp Gly Gly 


lie Ser 


Thr 


Gly 


145 










150 




155 






160 


Thr 


Thr 


lie 


Glu 


Ser 


Gly 


Asn Gin 


Asp Val Tyr Lys 


Gly Gly 


lie 


Ser 
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1 CR 

103 










1 / u 










1 7C 
1/3 




Asn 


p i • > 
Gly 


Thr 


Thr 


Tl „ 

lie 


Lys 


p l *# 

Giy 


Giy 


Aia 


ber 


Arg 


vai 


p i »i 

GlU 


P 1 ■ F 

Gly 


P 1 . F 

Gly 


Ser 








ion 

lou 










IOC 

la j 










i qh 

l^U 






Ala 


Asn 


p i ■ » 
Gly 


Tift 

lie 


T nil 

Leu 


Tl rt 

lie 


Asp 


Giy 


r*i »f 

Giy 


Car" 

ber 


Gin 


Tift 

lie 


vai 


T a Fft 

Lys 


vai 


Gin 






iyo 










onn 
zuu 
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650 










655 




Val 


Leu 


Asn 


Ser 


Ala 


Gly 


Thr 


Gin 


Thr 


He 


Tyr 


Ala 


Ser 


Ala 


Thr 


Ser 








660 










665 










670 






Asp 


Lys 


Ala 


Asn 


He 


Lys 


Gly 


Gly 


Lys 


Gin 


Thr 


Val 


Tyr 


Gly 


Leu* Ala 






675 










680 










685 








Thr 


Glu 


Ala 


Asn 


He 


Glu 


Ser 


Gly 


Glu 


Gin 


He 


Val 


Asp 


Gly 


Gly 


Ser 




690 










695 










700 










Thr 


Glu 


Lys 


Thr 


His 


He 


Asn 


Gly 


Gly 


Thr 


Gin 


Thr 


Val 


Gin 


Asn 


Tyr 


705 










710 










715 










720 


Gly 


Lys 


Ala 


He 


Asn 


Thr 


Asp 


He 


Val 


Ser 


Gly 


Leu 


Gin 


Gin 


He 


Met 










725 










730 










735 




Ala 


Asn 


Gly 


Thr 


Ala 


Glu 


Gly 


Ser 


He 


He 


Asn 


Gly 


Gly 


Ser 


Gin 


Val 








740 










745 










750 






Val 


Asn 


Glu 


Gly 


Gly 


Leu 


Ala 


Glu 


Asn 


Ser 


Val 


Leu 


Asn 


Asp 


Gly 


Gly 






755 










760 










765 








Thr 


Leu 


Asp 


Val 


Arg 


Glu 


Lys 


Gly 


Ser 


Ala 


Thr 


Gly 


He 


Gin 


Gin 


Ser 




770 










775 










780 










Ser 


Gin 


Gly 


Ala 


Leu 


Val 


Ala 


Thr 


Thr 


Arg 


Ala 


Thr 


Arg 


Val 


Thr 


Gly 


785 










790 
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1045 








1050 








1055 


Asp 


Ser 


Ala 


Thr 


Leu Thr Leu Gly Asn 


Gly Val Asp Thr Thr 


Leu 


Ala 






1060 








1065 








1070 




Asp 


Leu 


Thr 


Ala 


Ala 


Ser Arg Gly 


Ser 


Val 


Trp Leu Asn 


Ser 


Asn 


Asn 






1075 








1080 








1085 






Ser 


Cys 


Ala 


Gly 


Thr 


Ser 


Asn 


Cys 


Glu 


Tyr 


Arg 


Val 


Asn 


Ser 


Leu 


Leu 
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Asp Gly Asp 


Val 


Tyr 


Leu 


Ser 


Ala 


Gin 


Thr 


Ala 
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Ala 
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1125 1130 1135 

Ser Gly Asn Phe Tyr Leu His Thr Asn Val Ala Gly Ser Arg Gly Asp 

1140 1145 1150 

Gin Leu Val Val Asn Asn Asn Ala Thr Gly Asn Phe Lys He Phe Val 

1155 1160 1165 

Gin Asp Thr Gly Val Ser Pro Gin Ser Asp Asp Ala Met Thr Leu Val 

1170 1175 HBO 

Lys Thr Gly Gly Gly Asp Ala Ser Phe Thr Leu Gly Asn Thr Gly Gly 
1185 1190 1195 1200 

Phe Val Asp Leu Gly Thr Tyr Glu Tyr Val Leu Lys Ser Asp Gly Asn 

1205 1210 1215 

Ser Asn Trp Asn Leu Thr Asn Asp Val Lys Pro Asn Pro Asp Pro He 

1220 1225 1230 

Pro Asn Pro Lys Pro Asp Pro Lys Pro Asp Pro Lys Pro Asp Pro Asn 

1235 1240 1245 

Pro Lys Pro Asp Pro Thr Pro Asp Pro Thr Pro Thr Pro Val Pro Glu 

1250 1255 1260 

Lys Arg He Thr Pro Ser Thr Ala Ala Val Leu Asn Met Ala Ala Thr 
1265 1270 1275 1280 

Leu Pro Leu Val Phe Asp Ala Glu Leu Asn Ser He Arg Glu Arg Leu 

1285 1290 1295 

Asn He Met Lys Ala Ser Pro His Asn Asn Asn Val Trp Gly Ala Thr 

1300 1305 1310 

Tyr Asn Thr Arg Asn Asn Val Thr Thr Asp Ala Gly Ala Gly Phe Glu 

1315 * 1320 1325 

Gin Thr Leu Thr Gly Met Thr Val Gly lie Asp Ser Arg Asn Asp He 

1330 1335 1340 

Pro Glu Gly He Thr Thr Leu Gly Ala Phe Met Gly Tyr Ser His Ser 
1345 1350 1355- 1360 

His He Gly Phe Asp Arg Gly Gly His Gly Ser Val Gly Ser Tyr Ser 

1365 1370 1375 

Leu Gly Gly Tyr Ala Ser Trp Glu His Glu Ser Gly Phe Tyr Leu Asp 

1380 1385 1390 

Gly Val Val Lys Leu Asn Arg Phe Lys Ser Asn Val Ala Gly Lys Met 

1395 1400 1405 

Ser Ser Gly Gly Ala Ala Asn Gly Ser Tyr His Ser Asn Gly Leu Gly 

1410 1415 1420 

Gly His He Glu Thr Gly Met Arg Phe Thr Asp Gly Asn Trp Asn Leu 
1425 1430 1435 1440 

Thr Pro Tyr Ala Ser Leu Thr Gly Phe Thr Ala Asp Asn Pro Glu Tyr 

1445 1450 1455 

His Leu Ser Asn Gly Met Lys Ser Lys Ser Val Asp Thr Arg Ser lie 

1460 1465 1470 

Tyr Arg Glu Leu Gly Ala Thr Leu Ser Tyr Asn Met Arg Leu Gly Asn 

1475 1480 1485 

Gly Met Glu Val Glu Pro Trp Leu Lys Ala Ala Val Arg Lys Glu Phe 

1490 1495 1500 

Val Asp Asp Asn Arg Val Lys Val Asn Ser Asp Gly Asn Phe Val. Asn 
1505 * 1510 1515 1520 

Tyr Leu Ser Gly Arg Arg Gly lie Tyr Gin Ala Gly He Lys Ala Ser 

1525 1530 1535 

Phe Ser Ser Thr Leu Ser Gly His Leu Gly Val Gly Tyr Ser His Ser 

1540 1545 1550 

Ala Gly Val Glu Ser Pro Trp Asn Ala Val Ala Gly Val Asn Trp Ser 
1555 1560 1565 

Phe 



<210> 313 
<211> 38 
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<212> PRT 

<213> Escherichia coli 
<400> 313 

Met Lys Val Arg Ala Ser Val Lys Lys Leu Cys Arg Asn Cys Lys lie 

1 5 10 15 

Val Lys Arg Asp Gly Val lie Arg Val lie Cys Ser Ala Glu Pro Lys 

20 25 30 

His Lys Gin Arg Gin Gly 
35 

<210> 314 
<211> 443 
<212> PRT 

<213> Escherichia coli 



<400> 314 
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He 


Pro Gly He Asp Ala 


Ala 
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Lys 
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Gin 


Gin 


Arg 


Gly Thr He He Glu 


Met 


Phe 




50 










55 






60 






Asn 


Met 


Phe 


Ser 


Gly 
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Ala 


Leu 


Ser 


Arg Ala Ser He Phe 


Ala 


Leu 


65 










70 








75 




80 


Gly 


He 


Met 


Pro 


Tyr 


He 


Ser 


Ala 


Ser 


He He He Gin Leu 


Leu 


Thr 








85 










90 


95 




Val 


Val 


His 


Pro 


Thr 


Leu 


Ala 


Glu 


He 


Lys Lys Glu Gly Glu 


Ser 


Gly 








100 










105 


110 






Arg 


Arg 


Lys 


He 


Ser 


Gin 


Tyr 


Thr 


Arg 


Tyr Gly Thr Leu Val 


Leu 


Ala 






115 










120 




125 






He 


Phe 


Gin 


Ser 


He 


Gly 


He 


Ala 


Thr 


Gly Leu Pro Asn Met 


Pro 


Gly 




130 










135 






140 






Met 


Gin 


Gly 


Leu 


Val 


He 


Asn 


Pro 


Gly 


Phe Ala Phe Tyr Phe 


Thr 


Ala 


145 










150 
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160 


Val 


Val 


Ser 


Leu 


Val 


Thr 


Gly 


Thr 


Met 


Phe Leu Met Trp Leu 


Gly 


Glu 
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170 


175 




Gin 


He 


Thr 


Glu 


Arg 


Gly 


He 


Gly 


Asn 


Gly He Ser He He 


He 


Phe 








180 










185 


190 






Ala 


Gly 


He 


Val 


Ala 


Gly 


Leu 


Pro 


Pro 


Ala He Ala His Thr 


lie 


Glu 
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200 
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Gin 


Ala 


Arg 


Gin 


Gly 


Asp 


Leu 


His 


Phe 


Leu Val Leu Leu Leu 


Val 


Ala 
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215 






220 






Val 


Leu 


Val 


Phe 


Ala 


Val 


Thr 


Phe 


Phe 


Val Val Phe Val Glu 


Arg 


Gly 


225 










230 








235 




240 


Gin 


Arg 


Arg 


He 


Val 


Val 


Asn 


Tyr 


Ala 


Lys Arg Gin Gin Gly 


Arg 


Arg 










245 










250 


255 




Val 


Tyr 


.Ala 


Ala 


Gin 


Ser 


Thr 


His 


Leu 


Pro Leu Lys Val Asn 


Met 


Ala 






260 










265 


270 






Gly 


Val 


He 


Pro 


Ala 


He 


Phe 


Ala 


Ser 


Ser lie lie Leu Phe 


Pro 


Ala 




275 










280 




285 






Thr 


He 


Ala 


Ser 


Trp 


Phe 


Gly 


Gly 


Gly 


Thr Gly Trp Asn Trp 


Leu 


Thr 




290 










295 






300 






Thr 


He 


Ser 


Leu 


Tyr 


Leu 


Gin 


Pro 


Gly 


Gin Pro Leu Tyr Val 


Leu 


Leu 


305 










310 








315 




320 


Tyr 


Ala 


Ser 


Ala 


He 


He 


Phe 


Phe 


Cys 


Phe Phe Tyr Thr Ala 


Leu 


Val 
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330 


335 




Phe 


Asn 


Pro 


Arg 


Glu 


Thr 


Ala 


Asp 


Asn 


Leu Lys Lys Ser Gly 


Ala 


Phe 
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345 






JJ u 






Val 


Pro 


oiy 




Arg 




v nin Gin Thr Al a 
uiy uiu vjiti i hi. nx a 

*^fin 


Lys 


lyx: 
365 


T 1 O 

11c 






Val 


Met 
370 


i nr 


Axcf 


Leu 


i nr 


JbeU Val Vjiy Ala JjcU 

375 


.Tyr 
380 


He 


Thr 


trne 


lie 


Cys 


Leu 


±ie 


Pro 




rriie 


net. riX y ri&p rVXa net 


Lys 


Val 


Pro 


trne 


Tyr 


385 










390 


395 










400 


Phe Gly 


Gly 


Thr 


Ser 


Leu 


Leu lie Val Val Val 


Val 


He 


Met 


Asp 


Phe 








405 




410 








415 




Met 


Ala 


Gin 


Val 


Gin 


Thr 


Leu Met Met Ser Ser 


Gin Tyr Glu 


Ser 


Ala 








420 






425 






430 






Leu 


Lys 


Lys 
435 


Ala 


Asn 


Leu 


Lys Gly Tyr Gly Arg 
440 













<210> 315 
<211> 144 
<212> PRT 

<213> Escherichia coli 



<400> 315 






Met 


Arg 


Leu Asn Thr Leu 


Ser 


Pro 


1 




5 






Lys 


Arg 


Leu Gly Arg Gly 


lie 


Gly 






20 






Arg 


Gly 


His Lys Gly Gin 


Lys 


Ser 






35 




40 


Gly 


Phe 


Glu Gly Gly Gin 


Met 


Pro 




50 




55 




Gly 


Phe 


Thr Ser Arg Lys 


Ala 


Ala 


65 




70 






Asp 


Leu 


Ala Lys Val Glu 


Gly Gly 






85 






Ala 


Ala 


Asn lie He Gly 


He 


Gin 






100 






Ala 


Gly 


Glu Val Thr Thr 


Pro 


Val 






115 




120 


Lys 


Gly 


Ala Arg Ala Ala 


lie 


Glu 




130 




135 





Ala Glu Gly 


Ser Lys Lys 


Ala 


Gly 


10 




15 




Ser Gly Leu 


Gly Lys Thr 


Gly 


Gly 


25 


30 
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<210> 316 
<211> 59 
<212> PRT 

<213> Escherichia coli 
<400> 316 
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<212> PRT 
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<210> 318 
<211> 117 
<212> PRT 

<213> Escherichia coli 
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<210> 319 
<211> 177 
<212> PRT 

<213> Escherichia coli 
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<210> 320 
<211> 130 
<212> PRT 

<213> Escherichia coli 
<400> 320 
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<210> 321 
<211> 101 
<212> PRT 

<213> Escherichia coli 
<400> 321 
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<210> 322 
<211> 179 
<212> PRT 

<213> Escherichia coli 
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<210> 323 
<211> 104 
<212> PRT 

<213> Escherichia coli 
<400> 323 
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<210> 324 
<211> 123 
<212> PRT 
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<210> 325 
<211> 396 
<212> PRT 

<213> Escherichia coli 
<400> 325 
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<210> 326 
<211> 514 
<212> PRT 

<213> Escherichia coli 
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He 
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Arg 
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He 
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Leu 










405 




Thr 


Leu 


Tyr 
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Met 
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Leu 


Glu 






435 








Phe 
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Glu 


Leu 


Val 


Thr 
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Glu 


Val 
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465 
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Gin 
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Gly Glu 
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<210> 351 
<211> 236 
<212> PRT 

<213> Escherichia coli 
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Lys 
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Ser 
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Thr 
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He 
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Lys 
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Gly 
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Leu 


Lys 
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Glu 
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460 


Pro 
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Trp 
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Val 
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Phe 
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Val 
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Tyr 


Lys 
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He 


He 


He 
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Ala 








10 






Ala 
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Asn Gly Thr Leu Val Pro Gly Tyr Gin Pro Pro Lys Glu Met Lys Glu 

210 215 220 

Phe Leu Asp Glu His Gin Lys Met Thr Ser Gly Lys 
225 230 235 



<210> 352 
<211> 29B 
<212> PRT 

<213> Escherichia coli 



<400> 352 
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<210> 353 
<211> 246 
<212> PRT 

<213> Escherichia coli 
<400> 353 
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Met Ala Phe Ser Ser Ser Ser He 
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Trp Arg Leu Arg Ala 
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Lys 
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305 




310 




Leu Ser 


Pro 
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Ser Asp 
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Gin Met 


Leu Arg Pro Gly Val 
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Lys 
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ASp 
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405 




Thr Ser 


Val 


Gly Ala Phe Ser 


rne 
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495 




His 


Asn 


Leu 


Arg 


Phe 


Ser 


Arg 










510 






Gin 


Ser 


Leu 


Asn 


Asp 


Phe 


Gly 








525 








Lys 


Tvr 
i yx 


Tm 
11 f 


Asn 


Thr 


Ser 


Asp 






540 










Thr 


Ser 


Ser 


Tm 

IXf 


Val 


Glv 

oxy 


He 




555 










560 


Glu 


Ser 


Val 


Glv 
oxy 


lie 


Pro 


Asp 


570 










575 




Ser 


Val 


Pro 


Phe 


Asn 


Val 


Leu 










590 






Ala 


Leu 


Asp 


Arg 


Ala 


Tyr 


Ala 








605 








Gly 


Gin 


Asn 


Ser 


Trp 


Leu 


Ala 






620 










His 


Asn 


Leu 


Ser 


Tyr 


His 


Val 




635 










640 
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Ser 


Gin 


Gly 


Asp 


Thr 
645 


Ser 


Asn 


Asn 


Gly 


Tyr 
650 


Thr 


Gly 


Ser 


Ala 


Thr 
655 


Ala 


Asn 


Trp 


Gin 


Ala 
660 


Ala 


Tyr 


Gly 


Thr 


Leu 
665 


Gly 


Gly 


Gly 


Tyr 


Asn 
670 


Tyr 


Asp 


Arg 


Asp 


Gin 


His 


Asp 


Val 


Asn 


Trp 


Gin 


Leu 


Ser 


Gly 


Gly 


Val 


Val 


Gly 




675 










680 










685 








His 


Glu 
690 


Asn 


Gly 


He 


Thr 


Leu 
695 


Ser 


Gin 


Pro 


Leu 


Gly 
700 


Asp 


Thr 


Asn 


Val 


Leu 


He 


Lys 


Ala 


Pro 


Gly 


Ala 


Gly 


Gly 


Val 


Arg 


He 


Glu 


Asn 


Gin 


Thr 


705 








710 










715 










720 


Gly 


He 


Leu 


Thr 


Asp 


Trp 


Arg 


Gly 


Tyr 


Ala 


Val 


Met 


Leu 


Tyr 


Ala 


Thr 








725 










730 










735 




Val 


Tyr 


Arg 


Tyr 


Asn 


Arg 


He 


Ala 


Leu 


Asp 


Thr 


Asn 


Thr 


Met 


Gly 


Asn 






740 










745 










750 






Ser 


He 


Asp 
755 


Val 


Glu 


Lys 


Asn 


He 
760 


Ser 


Ser 


Val 


Val 


Pro 
765 


Thr 


Gin 


Gly 


Ala 


Leu 
770 


Val 


Arg 


Ala 


Asn 


Phe 
775 


Asp 


Thr 


Arg 


He 


Gly 
780 


Val 


Arg 


Ala 


Leu 


He 


Thr 


Val 


Thr 


Gin 


Gly 


Gly 


Lys 


Pro 


Val 


Pro 


Phe 


Gly 


Ser 


Leu 


Val 


785 










790 










795 










800 


Arg 


Glu 


Asn 


Ser 


Thr 


Gly 


He 


Thr 


Ser 


Met 


Val 


Gly 


Asp 


Asp 


Gly 


Gin 








805 










810 










815 




Val 


Tyr 


Leu 


Ser 
820 


Gly 


Ala 


Pro 


Leu 


Ser 
825 


Gly 


Glu 


Leu 


Leu 


Val 
830 


Gin 


Trp 


Gly 


Asp 


Gly 
835 


Ala 


Asn 


Ser 


Arg 


Cys 
840 


He 


Ala 


His 


Tyr 


Val 
845 


Leu 


Pro 


Lys 


Gin 


Ser 
850 


Leu 


Gin 


Gin 


Ala 


Val 
855 


Thr 


Val 


He 


Ser 


Ala 
860 


Val 


Cys 


Thr 


His 



Pro Gly Ser 
865 



<210> 358 
<211> 325 
<212> PRT 

<213> Escherichia coli 



<400> 358 



Met 


Ala 


Cys 


Leu 


Cys 


Leu 


Ala 


Asn 


He 


Ser 


Trp 


Ala 


Thr 


Val 


Cys 


Ala 


1 








5 










10 










15 




Asn 


Ser 


Thr 


Gly 


Val 


Ala 


Glu 


Asp 


Glu 


His 


Tyr 


Asp 


Leu 


Ser 


Asn 


lie 








20 










25 










30 






Phe 


Asn 


Ser 


Thr 


Asn 


Asn 


Gin 


Pro 


Gly 


Gin 


He 


Val 


Val 


Leu 


Pro 


Glu 






35 










40 










45 








Lys 


Ser 


Gly 


Trp 


Val 


Gly 


Val 


Ser 


Ala 


lie 


Cys 


Pro 


Pro 


Gly 


Thr 


Leu 




50 










55 










60 










Val 


Asn 


Tyr 


Thr 


Tyr 


Arg 


Ser 


Tyr 


Val 


Thr 


Asn 


Phe 


lie 


Val 


Gin 


Glu 


65 








70 










75 










80 


Thr 


He 


Asp 


Asn 


Tyr 


Lys 


Tyr 


Met 


Gin 


Leu 


His 


Asp 


Tyr 


Leu 


Leu 


Gly 










85 










90 










95 




Ala 


Met 


Ser 


Leu 


Val 


Asp 


Ser 


Val 


Met 


Asp 


He 


Gin 


Phe 


Pro 


Pro 


Gin 








100 










105 










110 






Asn 


Tyr 


He 


Arg 


Met 


Gly 


Thr 


Asp 


Pro 


Asn 


Val 


Ser 


Gin 


Asn 


Leu 


Pro 






115 










120 










125 








Phe 


Gly 


Val 


Met 


Asp 


Ser 


Arg 


Leu 


lie 


Phe 


Arg 


Leu 


Lys 


Val 


lie 


Arg 




130 










135 










140 










Pro 


Phe 


He 


Asn 


Met 


Val 


Glu 


He 


Pro 


Arg 


Gin 


Val 


Met 


Phe 


Thr 


Val 


145 










150 










155 










160 


Tyr 


Val 


Thr 


Ser 


Thr 


Pro 


Tyr 


Asp 


Pro 


Leu 


Val 


Thr 


Pro 


Val 


Tyr 


Thr 








165 










170 










175 




He 


Ser 


Phe 


Gly 


Gly 


Arg 


Val 


Glu 


Val 


Pro 


Gin 


Asn 


Cys 


Glu 


Leu 


Asn 
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180 


Ala 


Gly 


Gin 


He 






195 




Phe 


Ser 


Ala 


Ala 




210 






Thr 


Lys 


Ser 


He 


225 








Leu 


Thr 


Met 


Arg 


Ser 


Asp 


Asn 


Gin 








2oU 


Pro 


He 


Thr 


Pro 






275 




Ala 


Ala 


Ala 


Ala 




290 






Thr 


Gly 


Gin 


Lys 


305 








Arg 


Val 


Asp 


Tyr 



Val Glu 

Gly Pro 

Ala Val 
230 
Leu Glu 
245 

Asp Leu 

Asn Asp 

Ala Asn 

Pro Thr 
310 

Gin 
325 



Phe Asp 
200 
Gly Asn 
215 

Lys Cys 

Ala Ser 

Gly Phe 

Leu Asn 
280 
Val Thr 
295 

Glu Gly 



185 

Phe Gly 

Arg Pro 

Thr Asn 

Ala Val 
250 
He Val 
265 

Ser Val 
Leu Arg 
Pro Phe 



Asp He 

Ala Gly 
220 
Val Ala 
235 

Ser Gly 

Ala Asp 

He Pro 

Ala Trp 
300 
Ser Ala 
315 



190 
Gly Ala 
205 

Val Met 

Ala Gin 

Gin Ala 

Gin Asn 
270 
Phe Arg 
285 

Pro He 
Leu Gly 



Ser Leu 

Pro Gin 

Ala Tyr 
240 
Met Val 
255 

Asp Thr 

Leu Asp 

Ser He 

Tyr Leu 
320 



<210> 359 
<211> 171 
<212> PRT 

<213> Escherichia coli 



<400> 359 












Met 


Arg 


Arg 


Val 


Leu 


Phe Ser Cys 


Phe Cys Gly Leu 


Leu Trp Ser Ser 


1 




5 




10 


15 


Ser 


Gly 


Trp Ala 


Val 


Asp Pro Leu 


Gly Thr He Asn 


He Asn Leu His 






20 






25 


30 


Gly 


Asn 


Val 


Val 


Asp 


Phe Ser Cys 


Thr Val Asn Thr Ala Asp He Asp 




35 






40 




45 


Lys 


Thr 


Val 


Asp 


Leu 


Gly Arg Trp 


Pro Thr Thr Gin 


Leu Leu Asn Ala 


50 








55 


60 




Gly 


Asp 


Thr 


Thr 


Ala 


Leu Val Pro 


Phe Ser Leu Arg 


Leu Glu Gly Cys 


65 








70 


75 


80 


Pro 


Pro 


Gly 


Ser 


Val 


Ala He Leu 


Phe Thr Gly Thr 


Pro Ala Ser Asp 








85 




90 


95 


Thr 


Asn 


Leu 


Leu 


Ala 


Leu Asp Asp 


Pro Ala Met Ala 


Gin Thr Val Ala 








100 






105 


110 


He 


Glu 


Leu 


Arg 


Asn 


Ser Asp Arg Ser Arg Leu Ala Leu Gly Glu Ala 






115 






120 




125 


Ser 


Pro 


Thr 


Glu 


Glu 


Val Asp Ala Asn Gly Asn Val 


Thr Leu Asn Phe 




130 








135 


140 




Phe 


Ala 


Asn 


Tyr 


Arg 


Ala Leu Ala 


Ser Gly Val Arg 


Pro Gly Val Ala 


145 








150 


155 


160 


Lys 


Ala 


Asp Ala 


He 


Phe Met He Asn Tyr Asn 










165 




170 





<210> 360 ... 
<211> 194 
<212> PRT 

<213> Escherichia coli 
<400> 360 

Met Phe Gly Tyr Arg Ser Asn Val Pro Lys Val Arg Leu Thr Thr Asp 

15 10 15 

Arg Leu Val Val Arg Leu Val His Asp Arg Asp Ala Trp Arg Leu Ala 

20 25 30 

Asp Tyr Tyr Ala Glu Asn Arg His Phe Leu Lys Pro Trp Glu Pro Val 
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35 










40 




Arg Asp 


Glu 


Ser 


His 


Cys 


Tyr 


Pro 


Ser Gly 


50 










55 






Met He 


Asn 


Glu 


Phe 


His 


Lys 


Gin 


Gly Ser 


65 








70 








Phe Asp 


Pro 


Asp 


Glu 


Lys 


Glu 


He 


He Gly 








85 








90 


Val Val 


Arg 


Gly 


Ser 


Phe 


His 


Ala 


Cys Tyr 






100 










105 


Gin Lys 


Trp 


Gin 


Gly 


Lys 


Gly Leu 


Met Phe 




115 










120 




He Arg 


Tyr 


Met 


Gin 


Arg 


Thr 


Gin 


His He 


130 










135 






Tyr Met 


Pro 


His 


Asn 


Lys Arg 


Ser 


Gly Asp 


145 








150 








Phe Glu 


Lys 


Glu 


Gly 


Tyr Ala 


Lys 


Asp Tyr 








165 








170 


Trp Arg 


Asp 


His 


Val 


Leu 


Thr 


Ala 


Leu Thr 






180 










185 


Gly Arg 

















<210> 361 
<211> 215 
<212> PRT 

<213> Escherichia coli 



<400> 361 



Met 


Lys 


Tyr Gin 


Leu 


Thr 


Ala 


Leu 


Glu Ala 


1 






5 








10 


Leu 


Glu 


Lys Gin 


Val 


Thr 


Thr 


Pro 


Glu Gin 






20 










25 


Gly 


Val 


Val Thr 


Ala 


Cys 


Asn 


Gin 


Lys Thr 






35 








40 




Asn 


Leu 


Ser Glu 


Ser 


Glu 


Val 


Gin 


Glu Gin 




50 








55 






Arg 


His 


Tyr Leu 


Arg 


Thr 


Val 


Ser 


Gly Phe 


65 








70 








Tyr 


Glu 


Gin Arg 


Phe 


Cys 


Asn 


Ser 


Glu Phe 








85 








90 


Ala 


Ala 


Glu Val 


Ala 


Leu 


He 


Thr 


Thr Leu 






100 










105 


Thr 


Pro 


Gly Glu 


Leu Arg 


Ser 


Arg 


Ala Ala 






115 








120 




Asp 


Met 


Ala Glu 


Val 


Glu 


Ser 


Thr 


Leu Glu 


130 








135 






Asp 


Gly 


Pro Phe 


Val 


Val 


Arg 


Leu 


Ala Arg 


145 








150 








Asn 


Arg 


Tyr Met 


His 


Leu 


Phe 


Ser 


Gly Glu 








165 








170 


Val 


Thr 


Asp Met 


Ser 


Asn 


Ala 


Val 


Asp Gly 






180 










185 


Glu 


Ala 


Leu Glu 


He 


Glu 


Val 


Ala 


Glu Leu 






195 








200 




Leu 


Leu 


Ala His 


Leu Gly Asp 








210 








215 







<210> 362 
<211> 307 
<212> PRT 
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45 



Trp Gin Ala Arg 


Leu 


Gly 


60 






Ala Phe Tyr Phe 


Gly 


Leu 


75 




80 


Val Ala Asn Phe 


Ser 


Asn 




95 




Leu Gly Tyr Ser 


He 


Gly 


110 






Glu Ala Leu Thr 


Ala 


Ala 


125 






His Arg He Met 


Ala 


Asn 


140 






Leu Leu Ala Arg 


Leu 


Gly 


155 




160 


Leu Leu He Asp 


Gly 


Gin 




175 




Thr Pro Asp Trp 


Thr 


Pro 


190 






Arg Val He Gly 


Cys 


Leu 




15 




Tyr Pro Leu Ser 


Val 


Asn 


30 






Asn Arg Glu Pro 


Val 


Met 


45 






Leu Asp Asn Leu 


Val 


Lys 


60 






Gly Asn Arg Val 


Thr 


Lys 


75 




80 


Gly Asp Leu Lys 


Leu 


Ser 




95 




Leu Leu Arg Gly 


Ala 


Gin 


110 






Arg Met Tyr Glu 


Phe 


Ser 


125 






Gin Leu Ala Asn 


Arg 


Glu 


140 






Glu Pro Gly Lys 


Arg 


Glu 


155 




160 


Val Glu Asp Gin 


Pro 


Ala 




175 




Asp Leu Gin Ala 


Arg 


Val 


190 






Lys Gin Arg Leu 


Asp 


Ser 



205 
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<213> Escherichia coli 



<400> 362 



Met 


Lys 


Lys 


Leu 


Arg 


lie 


1 








5 




Lys 


Ala 


Trp 


Leu 


Pro 


vai 














Giy 


Aia 


Trp 


Cor* 

oer 


fro 


Thr 

1 IIJL 






35 








Trp 


Arg 


lie 


Pro 


Tyr 


hi a 




50 










Asp 


Ala 


val 


Phe 


val 


HIS 


65 










70 


Ser 


Thr 


Leu 


Leu 


Asn 


Ala 










85 




Ala 


Glu 


Asn 


Leu 


Arg 


Asp 








100 






Lys 


Lys 


Leu 


Thr 


Leu 


Met 






115 








Tyr 


Gly 


Glu 


Leu 


Lys 


Thr 




130 










Asp 


Lys 


His 


Arg 


Ser 


Asn 


145 










i en 

150 


T *■»» » 

Leu 


T /Ml 

Leu 


Asp 


Asp 


Tyr 


Leu 










1 c c 

165 




Gly 


Gly 


Lys 


7V 1 

Ala 


ser 


Leu 








180 






Gly 


GlU 


Met 


Leu 


fne 


Mia 














lie 


1 nr 


Thr 


Cys 


Met 


HIS 




z 1U 










Gin 


Ala 


Val 


Thr 


Asp 


Gly 


225 










230 


Trp 


Arg 


Glu 


Glu 


Arg 


Gly 










245 




Trp 


Gin 


Ser 


Thr 


Leu 


Glu 








260 






Phe 


He 


Glu 


Cys 


Val 


Gin 






275 








Gin 


Ala 


Val 


Leu 


Ala 


Gin 




290 










Met 


Ser 


Glu 









305 



<210> 363 
<211> 239 
<212> PRT 

<213> Escherichia coli 
<400> 363 

Met Leu Lys Arg Val Phe 

1 " 5 

Leu Leu Thr Val Leu Gly 
20 

Pro Tyr lie Tyr Asp Glu 
35 

Val Val Leu Gly Thr Ala 
50 

Tyr Tyr Arg Tyr Arg lie 
65 70 





VdX 


val 


m v 

oxy 


Leu 


oxy 








xu 






Leu 


H_Ld 




Ala 


oer 


Asp 
















Ala 




Ala 


Leu 


Pro 














Asp 


Ser 


Leu 


C a v 

oer 


Ser 


Leu 


3D 












oer 


C A V- 


i nr 


Aia 


oer 


HIS 










/o 




Gly 


vai 


His 


val 


Cys 


vai 








yu 






Aia 


G1U 


Arg 


Leu 


vai 


G1U 






105 








Val 


Gly 


Pne 


Asn 


Arg 


Arg 




lzO 










Gin 


Leu 


7\ 1 a 

nia 


i nr 


rVJ. a 


Ala 
nia 


i "3 c; 










14 0 


oer 


vai 


Gly 


fro 


HXS 


Asp 










13 3 




MIS 


\7a 1 


Vol 


Asp 


i nir 


aia 

AX a 








1 / u 






Asp 


vjxy 


vaJLy 


X III 


Leu 


Leu 






1DD 








IsXU 


HIS 


HIS 


fne 


Ser 


/*xa 




4t. u u 










Arg 


Arg 


Ala 


Gly 


Ser 


Gin 


215 










220 


Ala 


Leu 


He 


Asp 


He 


Thr 










235 




Gin 


Gly 


Val 


Val 


His 


Lys 








250 






Gin 


Arg 


Gly 


Phe 


Val 


Gly 






265 








Asn 


Gin 


Thr 


Val 


Pro 


Gin 




280 










Arg 


He 


Val 


Asp 


Lys 


lie 














Leu 


Ser 


Leu 


Leu 


Val 


Leu 








10 






Leu 


Asp 


Arg 


Trp 


Met 


Ser 






25 








Leu 


Gin 


Asp 


Leu 


Pro 


Tyr 




40 










Lys 


Tyr 


Tyr 


Arg 


Thr 


Gly 


55 










60 


Gin 


Gly 


Ala 


He 


Asn 

-7 5 


Ala 



75 
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uxy 


lie 


Ala 


Ulll 










Trp 


inr 


Leu 


Gin 




*an 






lie 


Cys 


G1U 


oer 


A C 

4 0 








Ala 


Ala 


Ser 


Cys 


pne 


Asp 


val 


val 








oU 


Asp 


Lys 


Pro 


Leu 










Leu 


7V 1 -* 

Ala 


TV 1 _ 

Ala 


Arg 




110 






Phe 


Ala 


Pro 


Leu 


IOC 

125 








Ser 


Leu 


Arg 


Met 


Leu 


Tyr 


fne 


inr 








lbU 


Leu 


Trp 


Leu 


Ser 






1 / D 




l nr 


Asn 


Asp 


ax a 




1 OA 

1 yu 






ijxy 


fro 


Leu 




one. 








Arg 


Glu 


Thr 


Val 


Asp 


Met 


Arg 


Glu 








240 


Pro 


He 


Pro 


Gly 










Cys 


Ala 


Arg 


His 




270 






Thr 


Ala 


Gly 


Glu 


285 








Trp 


Arg 


Asp 


Ala 


lie 


Gly 


Leu 


Leu 






15 




Trp 


Lys 


Thr 


Ala 




30 






Arg 


Gin 


Val 


Gly 


45 








Val 


lie 


Asn 


Gin 


Tyr 


Asn 


Ser 


Gly 
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Lys 


Val 


Asn Tyr 


Leu Leu 










85 


Asn 


Glu 


Pro 


Met 


Thr Met 








100 




Pro 


Ser 


Asp 


He 


Val Leu 






115 






He 


Val 


Arg 


Thr 


Arg Lys 




130 








Thr 


Gin 


Arg 


Phe 


His Cys 


145 








150 


Gly 


He 


Gin 


Ma 


Gin Cys 










165 


Ser 


Val 


Arg 


He 


Arg Glu 








180 




Leu 


Tyr 


He 


Phe 


Lys Arg 






195 






He 


Pro 


Ala 


Met 


His Gin 




210 








Val 


Thr 


Pro 


Glu 


Gin Leu 



225 230 

<210> 364 
<211> 79 
<212> PRT 

<213> Escherichia coli 
<400> 364 

Met Asp Val Gin Gin Phe 

1 5 
He Phe Cys Phe Arg Glu 
20 

Asp Lys Arg Val Lys Asn 
35 

Lys Asn Pro Gly Leu Phe 
50 

Gly He Leu Ser He Gly 
65 70 

<210> 365 
<211> 510 
<212> PRT 

<213> Escherichia coli 



<400> 365 



Met 


Ala 


Thr 


He 


Asp 


Ser 


1 








5 




Gly 


Pro 


Asp 


Trp 
20 


Thr 


Phe 


Asp 


Arg 


Val 
35 


Ala 


Lys 


Leu 


Glu 


Val 
50 


He 


Thr 


Ser 


Glu 


Met 


Pro 


He 


Asn 


Tyr 


Pro 


65 










70 


Thr 


Glu 


Arg 


Leu 


Tyr 
85 


Lys 


Val 


He 


Asn 


Ser 
100 


Asn 


Pro 


He 


Thr 


Met 


Gin 


Ala 


Leu 



115 



Leu 


Ser Gly Asp Asn 


Ala 






90 




Arg 


Lys 


Asp Leu He 


Ala 






105 




Asp 


Tyr 


Ala Gly Phe 


Arg 




120 






Val 


Phe Asp Thr Asn 


Asp 


135 






140 


Glu 


Arg Ala Leu Phe 


He 






155 




Tyr 


Ala 


Val Pro Ser 


Pro 






170 




Phe 


Ala 


Ala Arg Phe 


Gly 






185 




Glu 


Pro Arg Phe Leu 


Gly 




200 






Val 


Pro 


Glu Asp Ala 


Gin 


215 






220 


Leu 


Glu 


Leu Gin Lys 


Lys 






235 




Phe 


Val 


Val Ala Val 


Phe 






10 




Ala 


Trp 


Lys Gly Trp 


Arg 






25 




Ala 


Pro 


Glu Pro Val 


Tyr 




40 






Phe 


Ala 


Tyr Met Val 


Ala 


55 






60 


Met 


He Val Tyr Leu 


He 






75 




Met 


Asn 


Lys Asp Thr 


Thr 






10 




Asp 


Leu Leu Asp Val 


Tyr 






25 




Tyr 


Arg 


Leu Asp Thr 


Tyr 




40 






Gin 


Met 


Met Asp Ala 


Tyr 


55 






60 


His 


Trp 


Ser Phe Gly 


Lys 






75 




His 


Gly Gin Gin Gly 


Leu 






90 




Cys 


He 


Ala Tyr Leu 


Met 






105 




Val 


Met 


Ala His Ala 


Cys 




120 
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Leu 


Gin Ser 


Tyr 




95 




Ala 


Gly Val 


Asp 




110 




Thr 


Leu Asp 


Ser 


125 






Phe 


He He 


He 


Ala 


Leu His 


Met 






160 


Lys 


Asp Met 


Leu 




175 




Ala 


Leu Ala 


Asp 




190 




Pro 


Leu Val 


Pro 


205 






Gly 


Tyr Pro 


Ala 


Gin 


Gly Lys 





Phe 


Leu He 


Pro 




15 




Ala 


Gly Ala 


He 




30 




Val 


Trp Arg 


Ala 


45 






Tyr 


He Gly 


Phe 


Phe 


Tyr Arg 





Arg 


Leu 


Ser 


Asp 






15 




Leu 


Ala 


Glu 


He 




30 






Pro 


His 


Gin 


He 


45 








Ser 


Ser 


Val 


Gly 


4* 

Lys 'Phe 


He 


Glu 








80 


Ala Tyr Glu 


He 






95 




Glu 


Glu 


Asn 


Thr 




110 






Tyr 


Gly His 


Asn 


125 









-336- 



WO 01/34810 



PCT/US00/30950 



Ser 


rne 

X JU 


rne 


Lys 


Asn 


Asn 


Tyr 
135 


Leu 


rue 


Arg 


OCl 


Trn 

irp 


Thr 


Asp 


Ala 


Cor 
oex 


Ser 


Tl 0 

ixe 


Vdl 


Asp 


Tyr 


Leu 


He 


Phe 


Ala 


ZVr rr 


T i»e 
ujfo 


iyr 


T1p 
x x e 


Thr 


VjX u 




145 










150 










155 










160 


blU 


^*^ ii 
blu 


Arg 


Tyr 


biy 

lv«3 


Val 


Asp 


Glu 


Val 


Glu 
170 




Ton 


Leu 


Asp 


Ser 
175 


Cys 


HIS 


n 1 a 
t\±a. 


Leu 


He t. 


Asn 


Tyr Gly 


Val 


Asp 


Arg 


iyi 


T.v/c 
xiy o 


Arg 


Pro 


Gin 


Lys 








180 










185 










190 






lie 


ber 


Leu 


bin 


blU 


Glu 


Lys 


Ala Arg 


Gl n 

bin 


Lys 




Arg 


V3X LI 


Gl u 


iyr 






l y 0 










200 










205 








Leu 


bin 
o i n 

Z JLU 




bin 


W a 1 
Veil 


Asn 


Met 
215 


Leu 


Trp 


TV 

Arg 


Thr 
1 ill 


L€U 

990 

£. &. KJ 


tiU 


Lys 


Arg 


Gl u 

U1U 


blU 


blU 


Lys 


Th t* 

i nr 


vai 


Ala 


Glu 


Ala Arg 


Arg 


Tyr 


Pro 


Car 


fill! 


Pro 
riu 


Gin 


225 










230 










235 










240 


blU 


Asn 


Leu 


Leu 


Tyr 


Phe 


Met 


Glu 


Lys 


Asn 
9^n 

Z D \J 


nld 


Pro 


Leu 


Leu 


Gl n 

OX Li 

9SR 


■JCl 


Trp 


bin 


Arg 


m ii 

blU 


lie 


Leu Arg 


He 


Val 


Arg 


Lys 


V ct-L 


Cpr 


V3X1I 


iyr 


Php 
r tie 
















265 










970 

£. 1 K> 






Tyr 


Pro 


Gin 


Lys 


bin 


Thr 


Gin 


Val 


Met 


Asn 


gi ii 


biy 


Trp 


nld 


i nr 


□ ho 
rne 




275 










280 










285 








Trp 


liXS 

290 


Tyr 


i nr 


Tip 

i ie 


Leu 


Asn 
295 


His 


Leu 


T\/r 

iyr 


no 


300 


Gly 


Lys 


V dX 


Thr 


r i ii 
bill 


Arg 


rne 


net 


Leu 


Glu 


Phe 


Leu 


His 


Otr L 




L 11X 


Asn 


vai 


vdl 


Php 
r lie 


*5flR 










310 










315 










320 


bin 


Pro 


Pro 


Tyr 


Asn 


Ser 


Pro 


Trp Tyr 


Cpr 




T 1 P 




r x KJ 


T\/r 

i yr 


Ala 










19^ 










«n 










335 




Leu 


biy 


rne 


Hla 


wet 


Phe 


Gin 


Asp 


He 
345 


Lys 


Zii"r*t 
ni y 


lie 


fire 


Gl n 

V3X1 1 

350 


^pr 
oex 


Prn 

rx kj 


i nr 


r'l ii 
blU 


blU 

355 


Asp 


Lys 


Tyr 


Trp 


Phe 
360 


Pro 


Asp 


Tip 
lie 


Mia 


gi vr 

ljx y 

365 


oex 




irp 


Leu 


r*i ii 

bill 


Th y 

i nr 


Leu 


Hie 
ni o 


Phe 


Ala 


Met Arg 


Asp 


Phe 


Lys 


Asp 


Glu 


Ser 


Phe 




ITfl 

O / u 










375 










180 










Tin 


Ser 


bin 


IT IlC 


Leu 


Ser 


Pro 


Lys 


Val 


Meat* 
lie L. 


ni y 


no ^> 


Php 
r lie 


ZVrrr 
rvx y 


Php 

r lie 


Phe 


385 










390 










395 










400 


Th r> 

i nx 


Val 


Leo 


Asp 


Asp 


Asp Arg 


His 


Asn 


lyi 


Leu 


Glu 


He 


Ser 


Ala 


He 










405 










4 10 










415 




HIS 


Asn 


Gl ii 


gi ii 
420 


rzi w 
uiy 


Tyr 


Arg 


Glu 


He 
425 


Arg 


Asn 


Arg 


Leu 


Ser 
430 


Ser 


Gin 


Tyr 


Asn 


Leu 


Ser 


Asn 


Leu 


Glu 


Pro 


Asn 


Tip 

lie 


m n 


T 1 p 
lie 


irp 


Asn 


val 


t\j KJ 




435 










440 










445 








Leu 


Arg 
450 


biy 


Asp 


Arg 


Ser 


Leu 
455 


Thr 


Leu 


7A rn 
rti y 


T\/r 

iyr 


x x e 

460 


r x t/ 


niv> 


Asn 


nx y 


Ala 


Pro 


Leu 


Asp 


Arg 


Gly Arg 


Lys 


Glu 


Val 


Leu 


Lys 


His 


Val 


His 


Arg 


465 










470 










475 










480 


Leu 


Trp 


Gly 


Phe 


Asp 
485 


Val 


Met 


Leu 


Glu 


Gin 
490 


Gin 


Asn 


Glu 


Asp 


Gly 
495 


Ser 


He 


Glu 


Leu 


Leu 
500 


Glu 


Arg 


Cys 


Pro 


Pro 
505 


Arg 


Met 


Gly 


Asn 


Leu 
510 
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<210> 367 
<211> 239 
<212> PRT 

<213> Escherichia coli 
<400> 367 
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<212> PRT 

<213> Escherichia coli 
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<211> 274 
<212> PRT 

<213> Escherichia coli 
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<212> PRT 
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<210> 371 
<211> 113 
<212> PRT 

<213> Escherichia coii 
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<210> 372 
<211> 162 
<212> PRT 

•<213> Escherichia coli 
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<212> PRT 
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Glu 


He 


Leu 


Asp 
40 


Gly 


Gly 


Thr 


Ala 


Gly 
45 


Met 


Glu 


Leu 


Leu 


Gly 
50 


Asp 


Met 


Ala 


Asn 


Arg 
55 


Asp 


His 


Leu 


He 


He 
60 


Ala 


Asp 


Ala 


He 


Val 


Ser 


Lys 


Lys 


Asn 


Ala 


Pro 


Gly 


Thr 


Met 


Met 


He 


Leu 


Arg 


Asp 


Glu 


65 










70 










75 










80 


Glu 


Val 


Pro 


Ala 


Leu 
85 


Phe 


Thr 


Asn 


Lys 


He 
90 


Ser 


Pro 


His 


Gin 


Leu 
95 


Gly 


Leu 


Ala 


Asp 


Val 
100 


Leu 


Ser 


Ala 


Leu 


Arg 
105 


Phe 


Thr 


Gly 


Glu 


Phe 
110 


Pro 


Lys 


Lys 


Leu 


Thr 
115 


Leu 


Val 


Gly 


Val 


He 
120 


Pro 


Glu 


Ser 


Leu 


Glu 
125 


Pro 


His 


He 


Gly 


Leu 


Thr 


Pro 


Thr 


Val 


Glu 


Ala 


Met 


He 


Glu 


Pro 


Ala 


Leu 


Glu 


Gin 


130 










135 










140 










Val 


Leu 


Ala 


Ala 


Leu 


Arg 


Glu 


Ser 


Gly 


Val 


Glu 


Ala 


He 


Pro 


Arg 


Glu 


145 










150 










155 










160 


Ala 


He 


His 


Asp 



























<210> 374 
<211> 567 
<212> PRT 

<213> Escherichia coli 



<400> 374 



Met 


Ser 


Gin 


Arg 


He 


Thr 


He 


Asp 


Pro 


Val 


Thr 


Arg 


He 


Glu 


Gly 


His 


] 








5 










10 










15 




Leu 


Arg 


He 


Asp 


Cys 


Glu 


He 


Glu 


Asn 


Gly 


Val 


Val 


Ser 


Lys 


Ala 


Trp 








20 










25 










30 






Ala 


Ser 


Gly 


Thr 


Met 


Trp 


Arg 


Gly 


Met 


Glu 


Glu 


He 


Val 


Lys 


Asn 


Arg 






35 










40 










45 








Asp 


Pro 


Arg 


Asp 


Ala 


Trp 


Met 


He 


Val 


Gin 


Arg 


He 


Cys 


Gly 


Val 


Cys 




50 










55 










60 










Thr 


Thr 


Thr 


His 


Ala 


Leu 


Ser 


Ser 


Val 


Arg 


Ala 


Ala 


Glu 


Ser 


Ala 


Leu 


65 










70 










75 










80 


Asn 


He 


Asp 


Val 


Pro 


Val 


Asn 


Ala 


Gin 


Tyr 


He 


Arg 


Asn 


He 


He 


Leu 










85 










90 










95 




Ala 


Ala 


His 


Thr 


Thr 


His 


Asp 


His 


He 


Val 


His 


Phe 


Tyr 


Gin 


Leu 


Ser 








100 










105 










110 






Ala 


Leu 


Asp 


Trp 


Val 


Asp 


He 


Thr 


Ser 


Ala 


Leu 


Gin 


Ala 


Asp 


Pro 


Thr 






115 










120 










125 








Lys 


Ala 


Ser 


Glu 


Met 


Leu 


Lys 


Gly 


Val 


Ser 


Thr 


Trp 


His 


Leu 


Asn 


Ser 




130 










135 










140 










Pro 


Glu 


Glu 


Phe 


Thr 


Lys 


Val 


Gin 


Asn 


Lys 


He 


Lys 


Asp 


Leu 


Val 


Ala 


145 










150 










155 










160 


Ser 


Gly 


Gin 


Leu 


Gly 


He 


Phe 


Ala 


Asn 


Gly 


Tyr 


Trp 


Gly 


His 


Pro 


Ala 










165 










170 










175 




Met 


Lys 


Leu 


Pro 


Pro 


Glu 


Val 


Asn 


Leu 


He 


Ala 


Val 


Ala 


His 


Tyr 


Leu 








180 










185 










190 






Gin 


Ala 


Leu 


Glu 


Cys 


Gin 


Arg 


Asp 


Ala 


Asn 


Arg 


Val 


Val 


Ala 


Leu 


Leu 






195 










200 










205 








Gly 


Gly 


Lys 


Thr 


Pro 


His 


He 


Gin 


Asn 


Leu 


Ala 


Val 


Gly 


Gly 


Val 


Ala 




210 










215 










220 










Asn 


Pro 


He 


Asn 


Leu 


Asp 


Gly 


Leu 


Gly 


Val 


Leu 


Asn 


Leu 


Glu 


Arg 


Leu 


225 










230 










235 










240 


Met 


Tyr 


He 


Lys 


Ser 


Phe 


He 


Asp 


Lys 


Leu 


Ser 


Asp 


Phe 


Val 


Glu 


Gin 
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245 



Val Tyr 


Lys 


Val Asp 


Thr 


Ala 


vdl 






260 








Leu Thr 


Arg 


Gly Lys 


Gly Ala 


vai 




275 








d. oU 


Pro Thr 


Asp 


Ser Lys 


Asn 


Gly 


Car 

oer 


290 








295 




Glu Asn 


Ala 


Asp Leu 


Ser 


Ser 


Tyr 


305 






310 






Glu Tyr 


Leu 


lie Lys 


Gly 


He 


Gin 




325 








Lys Asp 


Glu 


Ala Pro 


Gin 


Ala 


Pro 






340 








Tyr Asp 


Gly 


Trp Ser 


Asp Asp 


Gly 




355 








360 


Thr Phe 


Tyr 


Gly Lys 


Thr 


Val 


Glu 


370 








375 




Val Lys 


Leu 


Ala Ala 


Gly Arg 


Glu 


385 






390 






lie Val 


Ala 


lie Tyr 


Gin 


Lys 


Leu 






405 








Gin Leu 


His 


Ser Thr 


Leu 


Gly 


Arg 






420 








Cys Glu 


Leu 


Gin Asp 


lie 


Leu 


G±n 


435 








a a n 
4 4 U 


Asn lie 


Gly 


Lys Gly 


Asp 


His 


inr 


450 








455 




Ala Thr 


Gly 


Glu Phe 


Lys 


Gly 


vai 


465 






470 






Met Leu 


Ser 


His Trp 


Met 


Val 


lie 






485 








Gin Ala 


Val 


Val Pro 


Ser 


Thr 


Trp 






500 








Asp Asp 


Val 


Gly Pro 


Tyr 


Glu 


Gin 




515 








520 


Asp Pro 


Asn 


Lys Pro 


Leu 


Glu 


Val 


530 








535 




Pro Cys 


Met 


Ala Cys 


Ala 


Val 


His 


545 






550 






Val Val 


Ser 


Val Lys 


Val 


Leu 








565 









<210> 375 
<211> 392 
<212> PRT 

<213> Escherichia coli 
<400> 375 



Met Ser 


His 


Asp 


Pro 


Gin Pro 


Leu 


1 






5 






Val Met 


He 


Phe 


Gly 


Pro Leu 


He 






20 








Lys Arg 


Leu 


val 


Phe 


Gly Leu 


Gly 




35 








40 


Phe Pro 


Trp 


Gly 


Val 


Trp He 


Ala 


50 








55 




Phe Ala 


Cys 


Gly 


Gly 


Trp Ala 


Leu 


65 








70 




Arg Gly 


Gin 


Tyr 


His 


Pro Leu 


Val 








85 







o tin 








255 




T I ft 7\ 1 _ Til — 

lie Ala Ala 


Phe Tyr 


Pro 


Glu 


Trp 








270 






Asn Tyr Leu 


Ser 


Val 


Pro 


Glu 


rfie 






285 








Phe Leu Phe 


Pro Gly Gly Tyr 


Tin 

lie 




300 










Arg Pro He 


Thr 


Ser 


His 


Ser 


Asp 


315 












Glu Ser Ala 


Lys 


His 


Ser 


Trp 


Tyr 


330 








335 




Trp Glu Gly 


Thr 


Thr 


He 


Pro 


Ala 


345 






350 






Lys Tyr Ser 


Trp 


Val 


Lys 


Ser 


Pro 






365 








Val Gly Pro 


Leu 


Ala 


Asn 


Met 


Leu 




380 










Ser Thr Gin 


Asn 


Lys 


Leu 


Asn 


Glu 


395 










4UU 


Thr Gly Asn 


Thr 


Leu 


Glu 


Val 


Ala 


4 1U 








4 Id 




He He Gly 


Arg 


Thr 


Val 


His 


Cys 


/IOC 






4 30 






Asn Gin Tyr 


Ser 


Ala 


Leu 


He 


i nr 






445 








Inr trne vai 


Lys 


Pro 


Asn 


He 


Pro 




460 










Cjiy Fne Leu 


Glu 


Ala 


Pro 


Arg 


u±y 


H 1 D 










a u n 


Lys Asp Gly 


He 


He 


Ser 


Asn 


Tyr 


a on 
4 yu 








495 




Asn Ser Gly 


Pro Arg Asn 


Phe 


Asn 


505 






510 






Ser Leu Val 


Gly 


Thr 


Pro 


Val 


Ala 






525 








Val Arg Thr 


He 


His 


Ser 


Phe 


Asp 




540 










Val Val Asp 


Ala Asp Gly Asn 


Glu 


555 










560 



Gly Gly Lys 


He He Ser Lys 


Pro 


10 


15 




Val He Cys 


Met Leu Leu He 


Val 


25 


30 




Ser Val Ser 


Asp Leu Asn Gly 


Gly 




45 




Phe Asp Leu 


Leu He Gly Thr 


Gly 




60 




Ala Trp Ala 


Val Tyr Val Phe 


Asn 


75 




80 


Arg Pre Ala 


Leu Leu Ala Ser 


Leu 


90 


95 
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Phe Gly Tyr Ser Leu Gly Gly Leu Ser lie Thr lie Asp Val Gly Arg 

100 105 110 

Tyr Trp Asn Leu Pro Tyr Phe Tyr lie Pro Gly His Phe Asn Val Asn 

115 120 125 

Ser Val Leu Phe Glu Thr Ala Val Cys Met Thr He Tyr He Gly Val 

130 135 140 

Met Ala Leu Glu Phe Ala Pro Ala Leu Phe Glu Arg Leu Gly Trp Lys 
145 150 155 160 

Val Ser Leu Gin Arg Leu Asn Lys Val Met Phe Phe He He Ala Leu 

165 170 175 

Gly Ala Leu Leu Pro Thr Met His Gin Ser Ser Met Gly Ser Leu Met 

180 185 190 

He Ser Ala Gly Tyr Lys Val His Pro Leu Trp Gin Ser Tyr Glu Met 

195 " 200 205 

Leu Pro Leu Phe Ser Leu Leu Thr Ala Phe He Met Gly Phe Ser He 

210 215 220 

Val He Phe Glu Gly Ser Leu Val Gin Ala Gly Leu Arg Gly Asn Gly 
225 230 235 240 

Pro Asp Glu Lys Ser Leu Phe Val Lys Leu Thr Asn Thr lie Ser Val 

245 250 255 

Leu Leu Ala lie Phe He Val Leu Arg Phe Gly Glu Leu lie Tyr Arg 

260 265 270 

Asp Lys Leu Ser Leu Ala Phe Ala Gly Asp Phe Tyr Ser Val Met Phe 

275 280 285 

Trp He Glu Val Leu Leu Met Leu Phe Pro Leu Val Val Leu Arg Val 

290 295 300 

Ala Lys Leu Arg Asn Asp Ser Arg Met Leu Phe Leu Ser Ala Leu Ser 
305 310 315 320 

Ala Leu Leu Gly Cys Ala Thr Trp Arg Leu Thr Tyr Ser Leu Val Ala 

325 330 335 

Phe Asn Pro Gly Gly Gly Tyr Ala Tyr Phe Pro Thr Trp Glu Glu Leu 

340 345 350 

Leu lie Ser lie Gly Phe Val Ala He Glu He Cys Ala Tyr lie Val 

355 " 360 365 

Leu lie Arg Leu Leu Pro He Leu Pro Pro Leu Lys Gin Asn Asp His 

370 375 380 

Asn Arg His Glu Ala Ser Lys Ala 
385 390 

<210> 376 
<211> 328 
<212> PRT 

<213> Escherichia coli 
<400> 376 

Met Asn Arg Arg Asn Phe lie Lys Ala Ala Ser Cys Gly Ala Leu Leu 

1 5 10 - 15 

Thr Gly Ala Leu Pro Ser Val Ser His Ala Ala Ala Glu Asn Arg Pro 

20 25 30 

Pro lie Pro Gly Ser Leu Gly Met Leu Tyr Asp Ser Thr Leu Cys Val 

35 40 45 

Gly Cys Gin Ala Cys Val Thr Lys Cys Gin Asp He Asn Phe Pro Glu 

50 55 60 

Arg Asn Pro Gin Gly Glu Gin Thr Trp Ser Asn Asn Asp Lys Leu Ser 
65 70 75 80 

Pro Tyr Thr Asn Asn lie lie Gin Val Trp Thr Ser Gly Thr Gly Val 

85 90 95 

Asn Lys Asp Gin Glu Glu Asn Gly Tyr Ala Tyr He Lys Lys Gin Cys 

100 105 HO 

Met His Cys Val Asp Pro Asn Cys Val Ser Val Cys Pro Val Ser Ala 
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115 






1 ?CL 

LCO 


Leu 


Lys 


Lys Asp 


_ w ^ m\ ^ ^ t T „ IT—I ii ' _ 

Pro Lys Thr Gly He Val His 


Tyr Asp Lys Asp Val 




130 








1 a n 
14U 


Cys 


Thr 


Gly Cys 


Arg Tyr Cys Met Val Ala Cys 


fro iyr /vsn vai fro 


145 








i c n ice 

15Q lbb 


lOU 


Lys 


Tyr 


Asp Tyr 


Asn Asn Pro Phe Gly Ala Leu 


His Lys Cys Glu Leu 






165 1/U 


i fO 


Cys 


Asn 


Gin Lys 


Gly Val Glu Arg Leu Asp Lys 


Gly Gly Leu Pro Gly 






180 


loo 


i on 

iyu 


Cys 


Val 


Glu 


Val 


Cys Pro Ala Gly Ala Val lie 


Phe Gly Thr Arg Glu 




195 




inn 
200 


one 


Glu 


Leu 


Met 


Ala 


Glu Ala Lys Lys Arg Leu Ala 


Leu Lys Pro Gly Ser 




210 






215 




Glu 


Tyr 


His 


Tyr 


Pro Arg Gin Thr Leu Lys Ser 


Gly Asp Thr Tyr Leu 


225 






230 235 


IAD 


His 


Thr 


Val 


Pro 


Lys Tyr Tyr Pro His Leu Tyr 


Gly Glu Lys Glu Gly 










245 250 




Gly 


Gly 


Thr 


Gin 


Val Leu Val Leu Thr Gly Val 


Pro Tyr Glu Asn Leu 




260 


265 


270 


Asp 


Leu 


Pro 


Lys 


Leu Asp Asp Leu Ser Thr Gly 


Ala Arg Ser Glu Asn 




275 




280 


285 


He 


Gin 


His 


Thr 


Leu Tyr Lys Gly Met Met Leu 


Pro Leu Ala Val Leu 




290 






295 


300 


Ala 


Gly 


Leu 


Thr 


Val Leu Val Arg Arg Asn Thr 


Lys Asn Asp His His 


305 






310 315 


320 


Asp 


Gly 


Gly Asp 


Asp His Glu Ser 





325 



<210> 377 
<211> 824 
<212> PRT 

<213> Escherichia coli 



<400> 377 



Met Leu 


Gin 


Cys 


Gly 


Ala 


Lys Asn Val Asn 


Pro 


Leu Glu Arg Phe Val 


1 






5 




10 




15 


Ser Ser 


Leu 


Pro 


Val 


Ala 


Ala Val Leu Pro 


Glu 


Leu Leu Thr Ala Leu 






20 






25 




30 


Asp Cys 


Ala 


Pro 


Gin 


Val 


Leu Leu Ser Ala 


Pro 


Thr Gly Ala Gly Lys 


35 








40 




45 


Ser Thr 


Trp 


Leu 


Pro 


Leu 


Gin Leu Leu Ala 


His 


Pro Gly He Asn Gly 


50 








55 




60 


Lys He 


lie 


Leu 


Leu 


Glu 


Pro Arg Arg Leu 


Ala 


Ala Arg Asn Val Ala 


65 








70 




75 


80 


Gin Arg 


Leu 


Ala 


Glu 


Leu 


Leu Asn Glu Lys 


Pro 


Gly Asp Thr Val Gly 






85 




90 




95 


Tyr Arg 


Met 


Arg 


Ala 


Gin Asn Cys Val Gly 


Pro 


Asn Thr Arg Leu Glu 




100 






105 




110 


Val Val 


Thr 


Glu 


Gly 


Val 


Leu Thr Arg Met 


He 


Gin Arg Asp Pro Glu 




115 






120 • 




125 


Leu Ser 


Gly 


Val 


Gly 


Leu 


Val He Leu Asp 


Glu 


Phe His Glu Arg Ser 


130 








135 




140 


Leu Gin 


Ala 


Asp 


Leu 


Ala 


Leu Ala Leu Leu 


Leu 


Asp Val Gin Gin Gly 


145 






150 




155 


160 


Leu Arg 


Asp 


Asp 


Leu 


Lys 


Leu Leu He Met 


Ser 


Ala Thr Leu Asp Asn 


165 




170 




175 


Asp Arg 


Leu 


Gin 


Gin 


Met 


Leu Pro Glu Ala 


Pro 


Val Val He Ser Glu 




180 






185 




190 


Gly Arg 


Ser 


Phe 


Pro 


Val Glu Arg Arg Tyr Leu 


Pro Leu Pro Ala His 


195 








200 




205 
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Gin Arg 


Phe 


Asp 


Asp 


Ala 


Val 


Ala 


Val 


Ala 


Thr 


Ala 


Glu 


Met 


Leu 


Arg 


210 










215 










220 










Gin Glu 


Ser 


Gly 


Ser 


Leu 


Leu 


Leu 


Phe 


Leu 


Pro 


Gly 


Val 


Gly 


Glu 


lie 


225 








230 










235 










240 


Gin Arg 


Val 


Gin 


Glu 


Gin 


Leu 


Ala 


Ser 


Arg 


He 


Gly 


Ser 


Asp 


Val 


Leu 






245 










250 










255 




Leu Cys 


Pro 


Leu 
260 


Tyr 


Gly 


Ala 


Leu 


Ser 
265 


Leu 


Asn 


Asp 


Gin 


Arg 
270 


Lys 


Ala 


lie Leu 


Pro 
275 


Ala 


Pro 


Gin 


Gly 


Met 
280 


Arg 


Lys 


Val 


Val 


Leu 
285 


Ala 


Thr 


Asn 


lie Ala 


Glu 


Thr 


Ser 


Leu 


Thr 


He 


Glu 


Gly 


He 


Arg 


Leu 


Val 


Val 


Asp 


290 










295 










300 










Cys Ala 


Gin 


Glu 


Arg 


Val 


Ala 


Arg 


Phe 


Asp 


Pro 


Arg 


Thr 


Gly 


Leu 


Thr 


305 








310 










315 










320 


Arg Leu 


He 


Thr 


Gin 


Arg 


Val 


Ser 


Gin 


Ala 


Ser 


Met 


Thr 


Gin 


Arg 


Ala 






325 










330 










335 




Gly Arg 


Ala 


Gly 
340 


Arg 


Leu 


Glu 


Pro 


Gly 
345 


He 


Ser 


Leu 


His 


Leu 
350 


He 


Ala 


Lys Glu 


Gin 


Ala 


Glu 


Arg 


Ala 


Ala 


Ala 


Gin 


Ser 


Glu 


Pro 


Glu 


He 


Leu 


355 










360 










365, 








Gin Ser 


Asp 


Leu 


Ser 


Gly 


Leu 


Leu 


Met 


Glu 


Leu 


Leu 


Gin *Trp 


Gly 


Cys 


370 










375 










380 










Ser Asp 


Pro 


Ala 


Gin 


Met 


Ser 


Trp 


Leu 


Asp Gin 


Pro 


Pro 


Val 


Val 


Asn 


385 








390 










395 










400 


Leu Leu 


Ala 


Ala 


Lys 
405 


Arg 


Leu 


Leu 


Gin 


Met 
410 


Leu 


Gly 


Ala 


Leu 


Glu 
415 


Gly 


Glu Arg 


Leu 


Ser 
420 


Ala 


Gin 


Gly 


Gin 


Lys 
425 


Met 


Ala 


Ala 


Leu 


Gly 
430 


Asn 


Asp 


Pro Arg 


Leu 
435 


Ala 


Ala 


Met 


Leu 


Val 
440 


Ser 


Ala 


Lys 


Asn 


Asp 
445 


Asp 


Glu 


Ala 


Ala Thr 


Ala 


Ala 


Lys 


He 


Ala 


Ala 


He 


Leu 


Glu 


Glu 


Pro 


Pro 


Arg 


Met 


450 








455 










4 60 










Gly Asn 


Ser 


Asp 


Leu 


Gly 


Val 


Ala 


Phe 


Ser Arg 


Asn 


Gin 


Pro 


Ala 


Trp 


465 








470 










475 










480 


Gin Gin 


Arg 


Ser 


Gin 
485 


Gin 


Leu 


Leu 


Lys 


Arg 
490 


Leu 


Asn 


Val 


Arg 


Gly 
495 


Gly 


Glu Ala 


Asp 


Ser 


Ser 


Leu 


He 


Ala 


Pro 


Leu 


Leu 


Ala 


Gly 


Ala 


Phe 


Ala 




500 










505 










510 






Asp Arg 


He 


Ala 


Arg 


Arg 


Arg 


Gly 


Gin 


Asp Gly 


Arg 


Tyr 


Gin 


Leu 


Ala 




515 










520 










525 








Asn Gly 


Met 


Gly 


Ala 


Met 


Leu 


Asp 


Ala 


Asn Asp 


Ala 


Leu 


Ser 


Arg 


His 


530 










535 










540 










Glu Trp 


Leu 


lie 


Ala 


Pro 


Leu 


Leu 


Leu 


Gin Gly 


Ser 


Ala 


Ser 


Pro 


Asp 


545 








550 










555 










560 


Ala Arg 


He 


Leu 


Leu 


Ala 


Leu 


Leu 


Val 


Asp 


He 


Asp 


Glu 


Leu 


Val 


Gin 






565 










570 










575 




Arg Cys 


Pro 


Gin 


Leu 


Val 


Gin 


Gin 


Ser 


Asp Thr 


Val 


Glu 


Trp 


Asp 


Asp 






580 










585 










590 






Ala Gin 


Gly 
595 


Thr 


Leu 


Lys 


Ala 


Trp 
600 


Arg 


Arg 


Leu 


Gin 


He 
605 


Gly 


Gin 


Leu 


Thr Val 


Lys 


Val 


Gin 


Pro 


Leu 


Ala 


Lys 


Pro 


Ser 


Glu 


Asp 


Glu 


Leu 


His 


610 










615 










620 










Gin Ala 


Met 


Leu 


Asn 


Gly 


He 


Arg 


Asp 


Lys Gly 


Leu 


Ser 


Val 


Leu 


Asn 


625 








630 










635 










640 


Trp Thr 


Ala 


Glu 


Ala 


Glu 


Gin 


Leu 


Arg 


Leu Arg 


Leu 


Leu 


Cys 


Ala 


Ala 






645 










650 










655 




Lys Trp 


Leu 


Pro 
660 


Glu 


Tyr 


Asp 


Trp 


Pro 
665 


Ala 


Val 


Asp 


Asp 


Glu 
670 


Ser 


Leu 


Leu Ala 


Ala 
675 


Leu 


Glu 


Thr 


Trp 


Leu 
680 


Leu 


Pro 


His 


Met 


Thr 
685 


Gly 


Val 


His 
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Ser 


Leu 


Arg 


Gly 


Leu 


Lys Ser Leu Asp He Tyr Gin Ala Leu Arg Gly 




690 








695 700 










Leu 


Leu 


Asp 


Trp 


Gly 


Met Gin Gin Arg Leu Asp Ser 


Glu 


Leu 


Pro 


Ala 


705 










710 715 








720 


His 


Tyr 


Thr 


Val 


Pro 


Thr Gly Ser Arg He Ala He 


Arg 


Tyr 


His 


Glu 








725 


730 






735 




Asp 


Asn 


Pro 


Pro 


Ala 


Leu Ala Val Arg Met Gin Glu 


Met 


Phe Gly Glu 






740 




745 




750 






Ala 


Thr 


Asn 
755 


Pro 


Thr 


He Ala Gin Gly Arg Val Pro 
760 


Leu 
765 


Val 


Leu 


Glu 


Leu 


Leu 
770 


Ser 


Pro 


Ala 


Gin Arg Pro Leu Gin He Thr 
775 780 


Arg 


Asp 


Leu 


Ser 


Asp 


Phe 


Trp 


Lys 


Gly 


Ala Tyr Arg Glu Val Gin Lys 


Glu 


Met 


Lys 


Gly 


785 








790 795 








800 


Arg 


Tyr 


Pro 


Lys 


His 


Val Trp Pro Asp Asp Pro Ala 


Asn 


Thr 


Ala 


Pro 






805 


810 






815 




Thr 


Arg 


Arg 


Thr 
820 


Lys 


Lys Tyr Ser 











<210> 378 
<211> 316 
<212> PRT 

<213> Escherichia coli 
<400> 378 



Met 


Ser 


Lys 


Leu 


Asp 


Thr Phe lie Gin His 


Ala 


Val 


Asn 


Ala Val 


Pro 


1 






5 


10 








15 




Val 


Ser 


Gly 


Thr 


Ser 


Leu He Ser Ser Leu 


Tyr 


Gly Asp 


Ser Leu 


Ser 






20 




25 








30 




His 


Arg 


Gly 


Gly 


Glu 


He Trp Leu Gly Ser 


Leu 


Ala 


Ala 


Leu Leu 


Glu 




35 






40 






45 






Gly 


Leu 


Gly 


Phe 


Gly 


Glu Arg Phe Val Arg 


Thr 


Ala 


Leu 


Phe Arg 


Leu 


50 








55 




60 








Asn 


Lys 


Glu 


Gly 


Trp 


Leu Asp Val Ser Arg 


He Gly Arg Arg Ser 


Phe 


65 








70 


75 








80 


Tyr 


Ser 


Leu 


Ser 


Asp 


Lys Gly Leu Arg Leu Thr Arg Arg Ala Glu 


Ser 








85 


90 








95 




Lys 


He 


Tyr 


Arg 


Ala 


Glu Gin Pro Ala Trp Asp Gly Lys Trp Leu Leu 




100 




105 








110 




Leu 


Leu 


Ser 


Glu 


Gly 


Leu Asp Lys Ser Thr 


Leu 


Ala 


Asp Val Lys 


Lys 






115 






120 






125 






Gin 


Leu 


He 


Trp 


Gin 


Gly Phe Gly Ala Leu 


Ala 


Pro 


Ser 


Leu Met 


Ala 




130 








135 




140 








Ser 


Pro 


Ser 


Gin 


Lys 


Leu Ala Asp Val Gin 


Thr 


Leu 


Leu 


His Glu 


Ala 


145 










150 


155 








160 


Gly 


Val 


Ala 


Asp 


Asn 


Val He Cys Phe Glu 


Ala 


Gin 


He 


Pro Leu 


Ala 








165 


170 








175 




Leu 


Ser 


Arg 


Ala 


Ala 


Leu Arg Ala Arg Val 


Glu 


Glu 


Cys 


Trp His 


Leu 






180 




185 








190 




Thr 


Glu 


Gin 


Asn 


Ala 


Met Tyr Glu Thr Phe 


He 


Gin 


Ser 


Phe Arg 


Pro 






195 






200 






205 






Leu 


Val 


Pro 


Leu 


Leu 


Lys Glu Ala Ala Asp 


Glu 


Leu 


Thr 


Pro Glu Arg 




210 








215 




220 








Ala 


Phe 


His 


He 


Gin 


Leu Leu Leu He His 


Phe Tyr Arg Arg Val Val 


225 










230 


235 








240 


Leu 


Lys 


Asp 


Pro 


Leu 


Leu Pro Glu Glu Leu 


Leu 


Pro 


Ala 


His Trp 


Ala 








245 


250 








255 




Gly 


His 


Thr 


Ala 


Arg 


Gin Leu Cys He Asn 


He 


Tyr 


Gin 


Arg Val 


Ala 






260 




265 








270 




Pro 


Ala 


Ala 


Leu 


Ala 


Phe Val Ser Glu Lys 


Gly Glu Thr 


Ser Val 


Gly 
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275 280 
Glu Leu Pro Ala Pro Gly Ser Leu 

290 295 
Asn lie Glu Gin Glu Ala Leu Cys 
305 310 

<210> 379 
<211> 196 
<212> PRT 

<213> Escherichia coli 



<400> 379 



Met 


Pro 


lie 


Tyr 


Gin 


He 


Asp Gly 


1 








5 






Ser 


Phe 


Val 


His 


Pro 


Thr 


Ala Val 








20 








Lys 


Gly 


Val 


Tyr 


Val 


Gly 


Pro Asn 






35 








40 


Arg 


lie 


Val 


Val 


Lys 


Asp 


Gly Ala 




50 










55 


His 


Gly 


Phe 


Pro 


Glu 


Gin 


Asp Thr 


65 










70 




Gly 


His 


Ser 


Ala 


He 


Leu 


His Gly 










85 






Val 


Gly 


Met 


Asn 


Ala 


Val 


Val Met 








100 








Ser 


He 


Val 


Gly 


Ala 


Ser 


Ala Phe 






115 








120 


Ala 


Asn 


Tyr 


Leu 


He 


Val 


Gly Ser 




130 










135 


Ser 


Glu 


Gin 


Glu 


Leu 


Ala 


Trp Lys 


145 










150 




Val 


Leu 


Val 


Thr 


Arg 


Cys 


Lys Gin 










165 






Arg 


Glu 


He 


Glu 


Pro 


Gly 


Arg Lys 








180 








Arg 


Pro 


Lys 


Gin 









195 

<210> 380 
<211> 426 
<212> PRT 

<213> Escherichia coli 



<400> 380 



Met 


Lys 


His 


Leu 


Thr 


Glu 


Met 


Val 


1 








5 








Asn 


Gly 


lie 


Tyr 


Ala 


Val 


Cys 


Ser 








20 










Ala 


He 


Arg 


Tyr 


Ala 


Ser 


Ala 


Asn 






35 










40 


Thr 


Ser 


Asn 


Gin 


Val 


Asp 


Gin 


Phe 




50 










55 




Ala 


Asp 


Phe 


Arg 


Gly 


Phe 


Val Cys 


65 










70 






Pro 


Gin 


Asp 


Ala 


Leu 


He 


Leu 


Gly 










85 








Trp 


Gin 


Asn 


Leu 


Pro 


Ala 


Ala 


Gin 








100 










He 


Lys 


Ser 


Tyr 


Val 


Ala 


Ala Gly 



285 



Tyr 


Phe 


Gln 


Arg Phe Gly 


Gly Leu 








300 




Gin 


Phe 


He 


Arg 








315 






Leu 


Thr 


Pro 


Val Val Pro 


Glu Glu 




10 






15 


Leu 


He Gly 


Asp Val lie 


Leu Gly 


25 






30 




Ala 


Ser 


Leu 


Arg Gly Asp 


Phe Gly 








45 




Asn 


He 


Gin 


Asp Asn Cys 


Val Met 








60 




Val 


Val 


Gly 


Glu Asp Gly 


His lie 






75 




80 


Cys 


lie 


lie 


Arg Arg Asn 


Ala Leu 




90 






95 


Asp 


Gly Ala 


Val Tie Gly 


Glu Asn 


105 






110 




Val 


Lys 


Ala 


Lys Ala Glu 


Met Pro 








125 




Pro 


Ala 


Lys 


Ala lie Arg 


Glu Leu 








140 




Lys 


Gin Gly 


Thr His Glu 


Tyr Gin 






155 




160 


Thr 


Leu 


His 


Gin Val Glu 


Pro Leu 




170 






175 


Arg 


Leu 


Val 


Phe Asp Glu 


Asn Leu 


185 






190 




Arg 


Gin 


His 


Lys Ala Gly 


Lys Thr 




10 






15 


Ala 


His 


Pro 


Leu Val Leu 


Glu Ala 


25 






30 




Gin 


Thr 


Pro 


Leu Leu lie 


Glu Ala 








45 




Gly 


Gly 


Tyr 


Thr Gly Met 


Thr Pro 








60 




Gin 


Leu 


Ala 


Asp Ser Leu 


Asn Phe 






75 




80 


Gly 


Asp 


His 


Leu Gly Pro 


Asn Arg 




90 






95 


Ala 


Met 


Ala 


Asn Ala Asp 


Asp Leu 


105 






110 




Phe 


Lys 


Lys 


lie His Leu 


Asp Cys 
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115 



120 



125 



Ser Met Ser Cys Gin Asp Asp Pro He Pro Leu Thr Asp Asp He Val 

130 " 135 140 

Ala Glu Arg Ala Ala Arg Leu Ala Lys Val Ala Glu Glu Thr Cys Leu 
145 " 150 155 160 

Glu His Phe Gly Glu Ala Asp Leu Glu Tyr Val He Gly Thr Glu Val 

165 170 175 

Pro Val Pro Gly Gly Ala His Glu Thr Leu Ser Glu Leu Ala Val Thr 

180 185 190 

Thr Pro Asp Ala Ala Arg Ala Thr Leu Glu Ala His Arg His Ala Phe 

195 200 205 

Glu Lys Gin Gly Leu Asn Ala He Trp Pro Arg He He Ala Leu Val 

210 215 220 

Val Gin Pro Gly Val Glu Phe Asp His Thr Asn Val He Asp Tyr Gin 
225 230 235 240 

Pro Ala Lys Ala Ser Ala Leu Ser Gin Met Val Glu Asn Tyr Glu Thr 

245 250 255 

Leu He Phe Glu Ala His Ser Thr Asp Tyr Gin Thr Pro Gin Ser Leu 

260 265 270. 

Arg Gin Leu Val He Asp His Phe Ala He Leu Lys Val Gly Pro Ala 

275 280 285 

Leu Thr Phe Ala Leu Arg Glu Ala Leu Phe Ser Leu Ala Ala He Glu 

290 295 300 

•Glu Glu Leu Val Pro Ala Lys Ala Cys Ser Gly Leu Arg Gin Val Leu 
305 310 315 320 

Glu Asp Val Met Leu Asp Arg Pro Glu Tyr Trp Gin Ser His Tyr His 

325 330 335 

Gly Asp Gly Asn Ala Arg Arg Leu Ala Arg Gly Tyr Ser Tyr Ser Asp 

340 345 350 

Arg Val Arg Tyr Tyr Trp Pro Asp Ser Gin He Asp Asp Ala Phe Ala 

355 " . 360 365 

His Leu Val Arg Asn Leu Ala Asp Ser Pro He Pro Leu Pro Leu He 

370 375 380 

Ser Gin Tyr Leu Pro Leu Gin Tyr Val Lys Val Arg Ser Gly Glu Leu 
385 ' 390 395 400 

Gin Pro Thr Pro Arg Glu Leu He He Asn His He Gin Asp He Leu 

405 410 415 

Ala Gin Tyr His Thr Ala Cys Glu Gly Gin 



<210> 381 
<211> 169 
<212> PRT 

<213> Escherichia coli 
<400> 381 



Met 


Lys 


Ala 


Asn 


Lys 


Gin Asn Lys Glu Glu His Ala Met 


Pro 


Asn He 


1 






5 


10 




15 


Val 


Leu 


Ser 


Arg 


He 


Asp Glu Arg Leu He His Gly Gin Val Gly Val 








20 




25 


30 




Gin 


Trp 


Val 


Gly 


Phe 


Ala Gly Ala Asn Leu Val Leu Val 


Ala 


Asn Asp 




35 




40 .45 






Glu 


Val 
50 


Ala 


Glu 


Asp 


Pro Val Gin Gin Asn Leu Met Glu 
55 60 


Met 


Val Leu 


Ala 


Glu 


Gly 


He 


Ala 


Val Arg Phe Trp Thr Leu Gin Lys 


Val 


He Asp 


65 








70 75 




80 


Asn 


lie 


His 


Arg 


Ala 


Ala Asp Arg Gin Lys He Leu Leu 


Val 


Cys Lys 








85 


90 




95 


Thr 


Pro 


Ala 


Asp 


Phe 


Leu Thr Leu Val Lys Gly Gly Val 


Pro 


Val Asn 



420 



425 



100 



105 



110 
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Arg He Asn Val Gly Asn 
115 

Ala Lys Thr Val Ser Val 
130 

Leu Lys Thr Ala Gly Val 
145 150 
Pro Ala Val Asp Leu Phe 
165 

<210> 382 
<211> 133 
<212> PRT 

<213> Escherichia coli 
<400> 382 



Met 


Glu 


He 


Ser 


Leu 


Leu 


1 








5 




He 


Ala 


Gly 


Leu 
20 


Asp 


Met 


Val 


Val 


Leu 
35 


Gly 


Pro 


Leu 


Gly 


He 
50 


Leu 


Thr 


Gly 


Gly 


Pro 


Leu 


Ala 


Gly 


Ala 


Gin 


65 










70 


Gly 


Thr 


Ala 


Phe 


Ala 
85 


lie 


Gly 


Val 


Ala 


Val 
100 


Pro 


Phe 


Leu 


Phe 


Ser 
115 


Val 


Met 


Ser 


Asn 


Pro 


Arg 


Arg 


He 





130 



<210> 383 
<211> 167 
<212> PRT 

<213> Escherichia coli 



<400> 383 



Met 


His 


Cys 


Tyr 


Asn 


Gly 


1 








5 




Met 


Val 


Gly 


Ala 
20 


Gly 


Leu 


Ala 


Asp 


Gly 
35 


His 


His 


Val 


Cys 


Ala 
50 


Lys 


Glu 


Arg 


lie 


Gly 


Met 


Pro 


Asp 


Gly 


Arg 


65 










70 


His 


Gly 


Gly 


Val 


Val 
85 


Arg 


Leu 


Ser 


Val 


Asp 
100 


Ala 


Ala 


Thr 


Pro 


Ala 
115 


Glu 


Ala 


lie 


Leu 


Gly 
130 


Val 


Asp 


Gly 


Val 


Arg 


Val 


Val 


Ala 


Leu 


Asp 


145 










150 



Met 


His 


Tyr 


Ala 


Asn 


Gly 




120 










Asp 


Ala 


Gly 


Asp 


He 


Ala 


135 










140 


Glu 


Cys 


Phe 


Val 


Gin 


Gly 










155 




Lys 


Leu 


Leu 








Gin 


Ala 


Phe 


Ala 


Leu 


Gly 








10 






Phe 


Asn 


Gly 


Leu 


Thr 


His 






25 








Val 


Gly 


Leu 


Val 


Leu 


Gly 




40 










Thr 


Leu 


Glu 


Leu 


Val 


Trp 


55 










60 


Pro 


Pro 


Asn 


Val 


lie 


lie 










75 




Thr 


Thr 


Gly 


Val 


Lys 


Pro 








90 






Ala 


Val 


Ala 


vox 


Gin 


Met 






105 








Gly 


Val 


Met 


Ser 


Arg 


Cys 




120 










Met 


Thr 


Gly 


Leu 


His 


His 








10 






Thr 


Asp 


Lys 


Arg 


Ala 


Trp 






25 








His 


Pro 


Ala 


Ala 


Met 


Ser 




40 










Val 


Leu 


He 


Thr 


Asp 


Ala 


55 










60 


Tyr 


Thr 


Leu 


Cys 


Gly 


Glu 










75 




Thr 


Ala 


Ser 


Gly 


Gly 


Leu 








90 






Val 


Arg 


Asn 


Met 


Val 


Glu 






105 








His 


Met 


Ala 


Ser 


Leu 


His 




120 










Leu 


Gly 


Ser 


Leu 


Lys 


Pro 


135 










140 


Ser 


Gly 


Leu 


His 


Val 


Gin 



155 



PCTAJS00/30950 



Lys 


Gin 


Gin 


lie 


125 








Ala 


Phe 


Asn 


Asp 


val 


Pro 


Thr 


Glu 








160 


lie 


lie 


Ala 


Phe 






15 




Met 


His 


Arg 


Pro 




30 






Asp 


Leu 


His 


Thr 


45 








Met 


Gly 


Leu 


Ala 


Gly 


Thr 


lie 


Val 








80 


Asp 


Val 


Ala 


Val 






95 




Gly 


lie 


Thr 


Phe 




110 






Asp 


Leu 


Ala 


Thr 


125 








Arg 


Glu 


Pro 


Gly 






15 




Leu 


Glu 


Leu 


lie 




30 






Leu 


Cys 


Cys 


Cys 


45 








Met 


Gin 


Ala 


Ala 


Glu 


Val 


Gin 


Met 








80 


Ala 


Gly 


Ser 


Thr 






95 




Leu 


Thr 


Gly 


Val 




110 






Pro 


Ala 


Arg 


Met 


125 








Gly 


Lys 


Arg 


Ala 


Gin 


lie 


Trp 


He 



160 
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Gin Gly Gin Leu Ala Ser Phe 
165 

<210> 384 
<211> 384 
<212> PRT 

<213> Escherichia coli 
<400> 384 

Met Pro Glu Asn Tyr Thr Pro Ala Ala Ala Ala Thr Gly Thr Trp Thr 

15 10 15 

Glu Glu Glu He Arg His Gin Pro Arg Ala Trp He Arg Ser Leu Thr 

20 25 30 

Asn He Asp Ala Leu Arg Ser Ala Leu Asn Asn Phe Leu Glu Pro Leu 

35 40 45 

Leu Arg Lys Glu Asn Leu Arg He He Leu Thr Gly Ala Gly Thr Ser 

50 55 60 

Ala Phe He Gly Asp He He Ala Pro Trp Leu Ala Ser His Thr Gly 
65 70 75 80 

Lys Asn Phe Ser Ala Val Pro Thr Thr Asp Leu Val Thr Asn Pro Met 

85 90 95 

Asp Tyr Leu Asn Pro Ala His Pro Leu Leu Leu He Ser Phe Gly Arg 

100 105 HO 

Ser Gly Asn Ser Pro Glu Ser Val Ala Ala Val Glu Leu Ala Asn Gin 

115 120 125 

Phe Val Pro Glu Cys Tyr His Leu Pro He Thr Cys Asn Glu Ala Gly 

130 135 140 

Ala Leu Tyr Gin Asn Ala He Asn Ser Asp Asn Ala Phe Ala Leu Leu 
145 150 155 160 

Met Pro Ala Glu Thr His Asp Arg Gly Phe Ala Met Thr Ser Ser He 

165 170 175 

Thr Thr Met Met Ala Ser Cys Leu Ala Val Phe Ala Pro Glu Thr He 

180 185 190 

Asn Ser Gin Thr Phe Arg Asp Val Ala Asp Arg Cys Gin Ala lie Leu 

195 200 205 

Thr Ser Leu Gly Asp Phe Ser Glu Gly Val Phe Gly Tyr Ala Pro Trp 

210 ~ 215 220 

Lys Arg He Val Tyr Leu Gly Ser Gly Gly Leu Gin Gly Ala Ala Arg 
225 230 235 240 

Glu Ser Ala Leu Lys Val Leu Glu Leu Thr Ala Gly Lys Leu Ala Ala 

245 250 255 

Phe Tyr Asp Ser Pro Thr Gly Phe Arg His Gly Pro Lys Ser Leu Val 

260 265 270 

Asp Asp Glu Thr Leu Val Val Val Phe Val Ser Ser His Pro Tyr Thr 

275 280 285 

Arg Gin Tyr Asp Leu Asp Leu Leu Ala Glu Leu Arg Arg Asp Asn Gin 

290 ' 295 300 

Ala Met Arg Val He Ala He Ala Ala Glu Ser Ser Asp He Val Ala 
305 " 310 315 320 

Ala Gly Pro His He He Leu Pro Pro Ser Arg His Phe He Asp Val 

325 330 335 

Glu Gin Ala Phe Cys Phe Leu Met Tyr Ala Gin Thr Phe Ala Leu Met 

340 345 350 

Gin Ser Leu His Met Gly Asn Thr Pro Asp Thr Pro Ser Ala Ser Gly 

355 ~ • 360 365 

Thr Val Asn Arg Val Val Gin Gly Val He He His Pro Trp Gin Ala 
370 " 375 380 

<210> 385 
<211> 286 
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<212> PRT 

<213> Escherichia coli 



<400> 385 



Met 


Ser 


He 


He 


Ser 


Thr 


Lys 


Tyr 


Leu 


Leu Gin Asp Ala 


Gin 


Ala 


Asn 


1 








5 










10 






15 




Gly 


Tyr 


Ala 


Val 


Pro 


Ala 


Phe 


Asn 


He 


His Asn Ala 


Glu 


Thr 


He 


Gin 




20 










25 






30 






Ala 


He 


Leu 


Glu 


Val 


Cys 


Ser 


Glu 


Met 


Arg Ser Pro 


Val 


lie 


Leu 


Ala 






35 










40 






45 








Gly 


Thr 


Pro 


Gly 


Thr 


Phe 


Lys 


His 


He 


Ala Leu Glu 


Glu 


lie 


Tyr 


Ala 


50 










55 






60 










Leu 


Cys 


Ser 


Ala 


Tyr 


Ser 


Thr 


Thr 


Tyr 


Asn Met Pro 


Leu 


Ala 


Leu 


His 


65 








70 








75 








80 


Leu 


Asp 


His 


His 


Glu 


Ser 


Leu 


Asp 


Asp 


lie Arg Arg Lys 


Val 


His 


Ala 








85 










90 






95 




Gly 


Val 


Arg 


Ser 


Ala 


Met 


lie 


Asp 


Gly 


Ser His Phe 


Pro 


Phe 


Ala 


Glu 






100 










105 






110 






Asn 


Val 


Lys 


Leu 


Val 


Lys 


Ser 


Val 


Val 


Asp Phe Cys 


His 


Ser 


Gin 


Asp 






115 










120 






125 








Cys 


Ser 


Val 


Glu 


Ala 


Glu 


Leu 


Gly 


Arg 


Leu Gly Gly Val 


Glu Asp 


Asp 


130 










135 






140 










Met 


Ser 


Val 


Asp 


Ala 


Glu 


Ser 


Ala 


Phe 


Leu Thr Asp 


Pro 


Gin 


Glu 


Ala 


145 








150 








155 








160 


Lys 


Arg 


Phe 


Val 


Glu 


Leu 


Thr 


Gly 


Val 


Asp Ser Leu 


Ala 


Val 


Ala 


He. 






165 










170 






175 




Gly 


Thr 


Ala 


His 


Gly 


Leu Tyr 


Ser 


Lys 


Thr Pro Lys 


lie 


Asp 


Phe 


Gin 






180 










185 






190 






Arg 


Leu 


H-ia 


ulu 


Tip 


Arg 


Glu 


Val 


Val 


Asp Val Pro 


Leu 


Val 


Leu 


His 




195 










200 






205 








Gly 


Ala 


Ser 


Asp 


Val 


Pro Asp 


Glu 


Phe 


Val Arg Arg 


Thr 


He 


Glu 


Leu 


210 










215 






220 










Gly 


Val 


Thr 


Lys 


Val 


Asn 


Val 


Ala 


Thr 


Glu Leu Lys 


He 


Ala 


Phe 


Ala 


225 










230 








235 








240 


Gly 


Ala 


Val 


Lys 


Ala 


Trp 


Phe 


Ala 


Glu 


Asn Pro Gin 


Gly 


Asn 


Asp 


Pro 








245 










250 






255 




Arg 


Tyr 


Tyr 


Met 


Arg 


Val Gly 


Met 


Asp 


Ala Met Lys 


Glu 


Val 


Val 


Arg 






260 










265 






270 






Asn 


Lys 


He 


Asn 


Val 


Cys Gly 


Ser 


Ala 


Asn Arg He 


Ser 


Ala 








275 










280 






285 








<210> 386 
























<211> 118 
























<212> PRT 
























<213> Escherichia coli 


















<400> 386 
























Met 


Ala 


Arg 


He 


Ala 


Gly He 


Asn 


He 


Pro Asp His 


Lys 


His 


Ala 


Val 


1 






5 










10 






15 




He 


Ala 


Leu 


Thr 


Ser 


He Tyr 


Gly 


Val 


Gly Lys Thr Arg 


Ser 


Lys 


Ala 








20 










25 






30 






He 


Leu 


Ala 


Ala 


Ala 


Gly He 


Ala 


Glu 


Asp Val Lys 


lie 


Ser 


Glu 


Leu 






35 










40 






45 








Ser 


Glu 


Gly 


Gin 


He 


Asp Thr 


Leu 


Arg 


Asp Glu Val Ala 


Lys 


Phe 


Val 




50 










55 






60 










Val 


Glu 


Gly 


Asp 


Leu 


Arg Arg 


Glu 


lie 


Ser Met Ser 


lie 


Lys 


Arg 


Leu 


65 










70 








75 








80 


Met 


Asp 


Leu 


Gly 


Cys 


Tyr Arg 


Gly 


Leu 


Arg His Arg Arg 


Gly Leu 


Pro 










85 










90 






95 




Val 


Arg 


Gly 


Gin 


Arg 


Thr Lys 


Thr 


Asn 


Ala Arg Thr Arg 


Lys Gly Pro 
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100 105 110 

Arg Lys Pro lie Lys Lys 
115 

<210> 387 
<211> 129 
<212> PRT 

<213> Escherichia coli 



<400> 387 



Met 


Ala 


Lys 


Ala 


Pro 


He 


Arg Ala 


Arg 


Lys Arg Val Arg 


Lys 


Gin 


Val 


1 








5 








10 




15 




Ser 


Asp 


Gly 


Val 


Ala 


His 


He His 


Ala 


Ser Phe Asn Asn 


Thr 


He 


Val 








20 








25 




30 






Thr 


He 


Thr 


Asp 


Arg 


Gin 


Gly Asn 


Ala 


Leu Gly Trp Ala 


Thr 


Ala 


Gly 






35 








40 




45 








Gly 


Ser 


Gly 


Phe 


Arg 


Gly 


Ser Arg 


Lys 


Ser Thr Pro Phe 


Ala 


Ala 


Gin 




50 










55 




60 








Val 


Ala 


Ala 


Glu 


Arg 


Cys 


Ala Asp 


Ala 


Val Lys Glu Tyr 


Gly 


He 


Lys 


65 










70 






75 






80 


Asn 


Leu 


Glu 


Val 


Met 


Val 


Lys Gly 


Pro 


Gly Pro Gly Arg 


Glu 


Ser 


Thr 










85 








90 




95 




lie 


Arg 


Ala 


Leu 


Asn 


Ala 


Ala Gly 


Phe 


Arg lie Thr Asn 


He 


Thr 


Asp 








100 








105 




110 






Val 


Thr 


Pro 


He 


Pro 


His 


Asn Gly 


Cys 


Arg Pro Pro Lys 


Lys 


Arg 


Arg 






115 








120 


125 









Val 



<210> 388 
<211> 206 
<212> PRT 

<213> Escherichia coli 
<400> 388 



Met 


Ala 


Arg 


Tyr 


Leu 


Gly 


Pro Lys 


Leu 


Lys Leu Ser Arg 


Arg 


Glu 


Gly 


1 








5 








10 




15 




Thr 


Asp 


Leu 


Phe 


Leu 


Lys 


Ser Gly 


Val 


Arg Ala He Asp 


Thr 


Lys 


Cys 








20 








25 




30 






Lys 


He 


Glu 


Gin 


Ala 


Pro 


Gly Gin 


His 


Gly Ala Arg Lys 


Pro 


Arg 


Leu 




35 








40 




45 








Ser 


Asp 


Tyr 


Gly 


Val 


Gin 


Leu Arg 


Glu 


Lys Gin Lys Val 


Arg 


Arg 


He 




50 










55 




60 








Tyr 


Gly 


Val 


Leu 


Glu 


Arg 


Gin Phe 


Arg 


Asn Tyr Tyr Lys 


Glu 


Ala 


Ala 


65 










70 






75 






80 


Arg 


Leu 


Lys 


Gly 


Asn 


Thr 


Gly Glu 


Asn 


Leu Leu Ala Leu 


Leu 


Glu 


Gly 










85 








90 




95 




Arg 


Leu 


Asp 


Asn 


Val 


Val 


Tyr Arg 


Met 


Gly Phe Gly Ala 


Thr 


Arg 


Ala 








100 








105 




110 






Glu 


Ala 


Arg 


Gin 


Leu 


Val 


Ser His 


Lys 


Ala He Met Val 


Asn 


Gly 


Arg 






115 








120 




125 








Val 


Val 


Asn 


He 


Ala 


Ser 


Tyr Gin 


Val 


Ser Pro Asn Asp 


Val 


Val 


Ser 




130 










135 




140 








He 


Arg 


Glu 


Lys 


Ala 


Lys 


Lys Gin 


Ser 


Arg Val Lys Ala 


Ala 


Leu 


Glu 


145 










150 






155 






160 


Leu 


Ala 


Glu 


Gin 


Arg 


Glu 


Lys Pro 


Thr 


Trp Leu Glu Val 


Asp 


Ala 


Gly 










165 








170 




175 




Lys 


Met 


Glu 


Gly 


Thr 


Phe 


Lys Arg 


Lys 


Pro Glu Arg Ser 


Asp 


Leu 


Ser 








180 








185 




190 






Ala 


Asp 


He 


Asn 


Glu 


His 


Leu He 


Val 


Glu Leu Tyr Ser 


Lys 
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195 200 205 

<210> 389 
<211> 329 
<212> PRT 

<213> Escherichia coli 



<400> 389 



Met 


Gin 


Gly 


Ser 


Val 


Thr 


Glu 


Phe 


Leu 


Lys 


Pro 


Arg 


Leu 


Val 


Asp 


He 


1 






5 










10 










15 




Glu 


Gin 


Val 


Ser 
20 


Ser 


Thr 


His 


Ala 


Lys 
25 


Val 


Thr 


Leu 


Glu 


Pro 
30 


Leu 


Glu 


Arg 


Gly 


Phe 


Gly His 


Thr 


Leu 


Gly 


Asn 


Ala 


Leu 


Arg 


Arg 


He 


Leu 


Leu 


35 










40 










45 








Ser 


Ser 


Met 


Pro Gly 


Cys 


Ala 


Val 


Thr 


Glu 


Val 


Glu 


He 


Asp 


Gly 


Val 




50 










55 










60 










Leu 


His 


Glu 


Tyr 


Ser 


Thr 


Lys 


Glu 


Gly 


Val 


Gin 


Glu 


Asp 


He 


Leu 


Glu 


65 










70 










75 










80 


He 


Leu 


Leu 


Asn 


Leu 
85 


Lys 


Gly 


Leu 


Ala 


Val 
90 


Arg 


Val 


Gin 


Gly 


Lys 
95 


Asp 


Glu 


Val 


He 


Leu 
100 


Thr 


Leu 


Asn 


Lys 


Ser 
105 


Gly 


He 


Gly 


Pro 


Val 
110 


Thr 


Ala 


Ala 


Asp 


He 


Thr 


His 


Asp 


Gly Asp 


Val 


Glu 


He 


Val 


Lys 


Pro 


Gin 


His 




115 










120 










125 








Val 


He 
130 


Cys 


His 


Leu 


Thr 


Asp 
135 


Glu 


Asn 


Ala 


Ser 


He 
140 


Ser 


Met 


Arg 


He 


Lys 


Val 


Gin 


Arg Gly 


Arg 


Gly Tyr 


Val 


Pro 


Ala 


Ser 


Thr 


Arg 


He 


His 


145 










150 










155 










160 


Ser 


Glu 


Glu 


Asp Glu 


Arg 


Pro 


He 


Gly 


Arg 


Leu 


Leu 


Val 


Asp 


Ala 


Cys 










165 










170 










175 




Tyr 


Ser 


Pro 


Val 
180 


Glu 


Arg 


He 


Ala 


Tyr 
185 


Asn 


Val 


Glu 


Ala 


Ala 
190 


Arg 


Val 


Glu 


Gin 


Arg 
195 


Thr 


Asp 


Leu 


Asp 


Lys 
200 


Leu 


Val 


He 


Glu 


Met 
205 


Glu 


Thr 


Asn 


Gly 


Thr 
210 


He 


Asp 


Pro 


Glu 


Glu 
215 


Ala 


He 


Arg 


Arg 


Ala 
220 


Ala 


Thr 


lie 


Leu 


Ala 


Glu 


Gin 


Leu 


Glu 


Ala 


Phe 


Val 


Asp 


Leu 


Arg 


Asp 


Val 


Arg 


Gin 


Pro 


225 










230 










235 










240 


Glu 


Val 


Lys 


Glu 


Glu 


Lys 


Pro 


Glu 


Phe 


Asp 


Pro 


He 


Leu 


Leu 


Arg 


Pro 








245 










250 










255 




Val 


Asp 


Asp 


Leu 


Glu 


Leu 


Thr 


Val 


Arg 


Ser 


Ala 


Asn 


Cys 


Leu 


Lys 


Ala 






260 










265 










270 






Glu 


Ala 


He 


His 


Tyr 


He 


Gly Asp 


Leu 


Val 


Gin 


Arg 


Thr 


Glu 


Val 


Glu 






275 










280 










285 








Leu 


Leu 


Lys 


Thr 


Pro 


Asn 


Leu Gly 


Lys 


Lys 


Ser 


Leu 


Thr 


Glu 


He 


Lys 




290 










295 










300 










Asp 


Val 


Leu 


Ala 


Ser 


Arg 


Gly Leu 


Ser 


Leu 


Gly 


Met 


Arg 


Leu 


Glu 


Asn 


305 










310 










315 










320 


Trp 


Pro 


Pro 


Ala 


Ser 


He 


Ala 


Asp 


Glu 

















325 



<210> 390 
<211> 127 
<212> PRT 

<213> Escherichia coli 
<400> 390 

Met Arg His Arg Lys Ser Gly Arg Gin Leu Asn Arg Asn Ser Ser His 

15 10 15 

Arg Gin Ala Met Phe Arg Asn Met Ala Gly Ser Leu Val Arg His Glu 
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zu 








30 




lie 


lie 


Lys 


Thr 


Thr 


Leu Pro Lys 


Ala 


Liys bill LicU nig niy vax 


VOX 






35 










4 R 




Glu 


Pro 


Leu 


lie 


Thr 


Leu Ala Lys 


i nr 


ASp oer vdl Hid noil nxy 


Arg 












RR 




fin 




Lieu 




trne 


rlla 


Arg 


HlX riXy nop 


Asn 


Glu Tie Val Ala Lvs Leu 


Phe 


65 








70 




75 


80 


Asn 


Glu 


Leu 


Gly 


Pro 


Arg Phe Ala 


Ser 


Arg Ala Gly Gly Tyr Thr 


Arg 










85 






90 95 




lie 


Leu 


Lys 


Cys 


Gly 


Phe Arg Ala 


Gly 


Asp Asn Ala Pro Met Ala 


Tyr 






100 






105 


110 




lie 


Glu 


Leu 


Val 


Asp 


Arg Ser Glu 


Lys 


Ala Glu Ala Ala Ala Glu 








115 






120 




125 





<210> 391 
<211> 243 
<212> PRT 

<213> Escherichia coli 
<400> 391 



Met 


Lys 


Met 


Leu 


Arg 


Asp 


Pro 


Leu 


Phe 


Trp 


Leu He Ala Leu Phe 


Ma 1 

v ax 


1 






5 










10 


15 




Ala 


Leu 


He 


pne 


Trp 


Leu 


Pro 


Tyr 


Ser 


Gin 


Pro Leu Phe Ala Ala 


Leu 


















25 




30 




Phe 


Pro 


Gin 


Leu 


Pro 


Arg 


Pro 


Val 


Tyr 


Gin 


Gin Glu Ser Phe Ala 


Ala 






35 










40 






45 




Leu 


Ala 


Leu 


Ala 


His 


Phe 


Trp 


Leu 


Val 


Gly 


He Ser Ser Leu Phe 


Ala 




50 










55 








60 




Val 


He 


He 


Gly 


Thr 


Gly Ala Gly 


He 


Ala 


Val Thr Arg Pro Trp 


Gly 


65 










70 










75 


80 


Ala 


Glu 


Phe 


Arg 


Pro 


Leu 


Val 


Glu 


Thr 


He Ala Ala Val Gly Gin 


Thr 








85 










90 


95 




Phe 


Pro 


Pro 


Val 


Ala 


Val 


Leu 


Ala 


He 


Ala 


Val Pro Val He Gly 


Phe 








100 










105 




110 




Gly Leu 


Gin 


Pro 


Ala 


He 


He 


Ala 


Leu 


He 


Leu Tyr Gly Val Leu 


Pro 






115 










120 






125 




Val 


Leu 


Gin 


Ala 


Thr 


Leu 


Ala 


Gly Leu 


Gly Ala He Asp Ala Ser 


Val 




130 










135 








140 




Thr 


Glu 


Val 


Ala 


Lys 


Gly Met 


Gly Met 


Ser Arg Gly Gin Arg Val 


Arg 


145 










150 










155 


160 


Lys 


Val 


Glu 


Leu 


Pro 


Leu 


Ala 


Ala 


Pro 


Val 


He Leu Ala Gly Val 


Arg 








165 










170 


175 




Thr 


Ser 


Val 


He 


He 


Asn 


He 


Gly Thr 


Ala 


Thr He Ala Ser Thr 


Val 








180 










185 




190 




Gly Ala 


Ser 


Thr 


Leu 


Gly Thr 


Pro 


He 


He 


He Gly Leu Ser Gly 


Phe 






195 










200 






205 




Asn 


Thr 


Ala 


Tyr 


Val 


He 


Gin Gly Ala 


Leu 


Leu Val Ala Leu Ala 


Ala 




210 








215 








220 




He 


He 


Ala 


Asp 


Arg 


Leu 


Phe 


Glu 


Arg 


Leu 


Val Gin Ala Leu Ser 


Gin 


225 








230 










235 


240 


His 


Ala 


Lys 





















<210> 392 
<211> 308 
<212> PRT 

<213> Escherichia coli 
<400> 392 

Met He Glu Phe Ser His Val Ser Lys Leu Phe Gly Ala Gin Lys Ala 
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1 



5 



10 



15 



Val Asn Asp Leu Asn Leu Asn Phe Gin Glu Gly Ser Phe Ser Val Leu 

20 25 30 

lie Gly Thr Ser Gly Ser Gly Lys Ser Thr Thr Leu Lys Met lie Asn 

35 40 45 

Arg Leu Val Glu His Asp Ser Gly Glu He Arg Phe Ala Gly Glu Glu 

50 55 60 

He Arg Ser Leu Pro Val Leu Glu Leu Arg Arg Arg Met Gly Tyr Ala 
65 70 75 80 

He Gin Ser He Gly Leu Phe Pro His Trp Ser Val Ala Gin Asn He 

85 90 95 

Ala Thr Val Pro Gin Leu Gin Lys Trp Ser Arg Ala Arg He Asp Asp 

100 105 HO 

Arg He Asp Glu Leu Met Ala Leu Leu Gly Leu Glu Ser Asn Leu Arg 

115 120 125 

Glu Arg Tyr Pro His Gin Leu Ser Gly Gly Gin Gin Gin "Arg Val Gly 

130 135 140 

Val Ala Arg Ala Leu Ala Ala Asp Pro Gin Val Leu Leu Met Asp Glu 
145 " 150 155 160 

Pro Phe Gly Ala Leu Asp Pro Val Thr Arg Gly Ala Leu Gin Gin Glu 

165 170 175 

Met Thr Arg He His Arg Leu Leu Gly Arg Thr He Val Leu Val Thr 

180 185 190 

His Asp He Asp Glu Ala Leu Arg Leu Ala Glu His Leu Val Leu Met 

195 200 205. 

Asp His Gly Glu Val Val Gin Gin Gly Asn Pro Leu Thr Met Leu Thr 

210 * 215 220 

Arg Pro Ala Asn Asp Phe Val Arg Gin Phe Phe Gly Arg Ser Glu Leu 
225 230 235 240 

Gly Val Arg Leu Leu Ser Leu Arg Ser Val Ala Asp Tyr Val Arg Arg 

245 250 255 

Glu Glu Arg Ala Asp Gly Glu Ala Leu Ala Glu Glu Met Thr Leu Arg 

260 265 270 

Asp Ala Leu Ser Leu Phe Val Ala Arg Gly Cys Glu Val Leu Pro Val 

275 280 285 

Val Asn Met Gin Gly Gin Pro Cys Gly Thr Leu His Phe Gin Asp Leu 

290 295 300 

Leu Val Glu Ala 
305 

<210> 393 
<211> 385 
<212> PRT 

<213> Escherichia coli 
<400> 393 

Met Thr Tyr Phe Arg He Asn Pro Val Leu Ala Leu Leu Leu Leu Leu 

1 5 10 15 

Thr Ala He Ala Ala Ala Leu Pro Phe He Ser Tyr Ala Pro Asn Arg 

20 25 30 

Leu Val Ser Gly Glu Gly Arg His Leu Trp Gin Leu Trp Pro Gin Thr 

35 40 45 

He Trp Met Leu Val Gly Val Gly Cys Ala Trp Leu Thr Ala Cys Phe 

50 55 60 

He Pro Gly Lys Lys Gly Ser He Cys Ala Leu He Leu Ala Gin Phe 
65 70 75 80 

Val Phe Val Leu Leu Val Trp Gly Ala Gly Lys Ala Ala Thr Gin Leu 

85 90 95 

Ala Gin Asn Gly Ser Ala Leu Ala Arg Thr Ser Leu Gly Ser Gly Phe 



100 



105 



110 
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Trp Leu Ala Ala Ala Leu Ala Leu Leu Ala Cys Ser Asp Ala lie Arg 

115 120 125 

Arg He Ser Thr His Pro Leu Trp Arg Trp Leu Leu His Met Gin He 

130 135 140 

Ala He He Pro Leu Trp Leu Leu Tyr Ser Gly Thr Leu Asn Asp Leu 
145 150 155 160 

Ser Leu Met Lys Glu Tyr Ala Asn Arg Gin Asp Val Phe Asp Asp Ala 

165 170 175 

Leu Ala Gin His Leu Thr Leu Leu Phe Gly Ala Val Leu Pro Ala Leu 

180 185 190 

Val He Gly Val Pro Leu Gly He Trp Cys Tyr Phe Ser Thr Ala Arg 

195 200 205 

Gin Gly Ala He Phe Ser Leu Leu Asn Val He Gin Thr Val Pro Ser 

210 215 220 

Val Ala Leu Phe Gly Leu Leu He Ala Pro Leu Ala Ala Leu Val Thr 
225 230 235 240 

Ala Phe Pro Trp Leu Gly Thr Leu Gly He Ala Gly Thr Gly Met Thr 

245 250 255 

Pro Ala Leu lie Ala Leu Val Leu Tyr Ala Leu Leu Pro Leu Val Arg 

260 265 . 270 

Gly Val Val Val Gly Leu Asn Gin He Pro Arg Asp Val Leu Glu Ser 

275 280 285 

Ala Arg Ala Met Gly Met Ser Gly Ala Gin Arg Phe Leu His Val Gin 

290 .295 300 

Leu Pro Leu Ala Leu Pro Val Phe Leu Arg Ser Leu Arg Val Val Met 
305 310 315 320 

Val Gin Thr Val Gly Met Ala Val He Ala Ala Leu He Gly Ala Gly 

325 330 335 

Gly Phe Gly Ala Leu Val Phe Gin Gly Leu Leu Ser Ser Ala lie Asp 

340 345 350 

Leu Val Leu Leu Gly Val He Pro Val He Val Leu Ala Val Leu Thr 

355 360 365 

Asp Ala Leu Phe Asp Leu Leu He Ala Leu Leu Lys Val Lys Arg Asn 
370 375 380 

Asp 
385 

<210> 394 
<211> 305 
<212> PRT 

<213> Escherichia coli 



<400> 394 



Met 


Pro 


Leu 


Leu 


Lys 


Leu Trp Ala Gly Ser Leu Val Met 


Leu 


Ala 


Ala 


1 








5 


10 




15 




Val 


Ser 


Leu 


Pro 


Leu 


Gin Ala Ala Ser Pro Val Lys Val 


Gly 


Ser 


Lys 








20 




25 


30 






He 


Asp 


Thr 


Glu 


Gly 


Ala Leu Leu Gly Asn He He Leu 


Gin 


Val 


Leu 




35 






40 . 45 








Glu 


Ser 


His 


Gly 


Val 


Pro Thr Val Asn Lys Val Gin Leu Gly Thr 


Thr 




50 






55 60 . 








Pro 


Val 


Val 


Arg 


Gly 


Ala lie Thr Ser Gly Glu Leu Asp 


lie 


Tyr 


Pro 


65 






70 75 






80. 


Glu 


Tyr 


Thr 


Gly 


Asn 


Gly Ala Phe Phe Phe Lys Asp Glu Asn Asp Ala 






85 


90 




95 




Ala 


Trp 


Lys 


Asn 


Ala 


Gin Gin Gly Tyr Glu Lys Val Lys 


Lys 


Leu Asp 




100 




105 


110 






Ser 


Glu 


His 


Asn 


Lys 


Leu He Trp Leu Thr Pro Ala Pro 


Ala 


Asn 


Asn 






115 






120 . 125 








Thr 


Trp 


Thr 


He 


Ala 


Val Arg Gin Asp Val Ala Glu Lys 


Asn 


Lys 


Leu 
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130 135 140 

Thr Ser Leu Ala Asp Leu Ser Arg Tyr Leu Gin Glu Gly Gly Thr Phe 
145 ~ 150 155 160 

Lys Leu Ala Ala Ser Ala Glu Phe He Glu Arg Ala Asp Ala Leu Pro 

165 170 175 

Ala Phe Glu Lys Ala Tyr Gly Phe Lys Leu Gly Gin Asp Gin Leu Leu 

180 185 190 

Ser Leu Ala Gly Gly Asp Thr Ala Val Thr He Lys Ala Ala Ala Gin 

195 200 205 

Gin Thr Ser Gly Val Asn Ala Ala Met Ala Tyr Gly Thr Asp Gly Pro 

210 215 220 

Val Ala Ala Leu Gly Leu Gin Thr Leu Ser Asp Pro Gin Gly Val Gin 
225 230 235 240 

Pro He Tyr Ala Pro Ala Pro Val Val Arg Glu Ser Val Leu Arg Glu 

245 250 255 

Tyr Pro Gin Met Ala Gin Trp Leu Gin Pro Val Phe Ala Ser Leu Asp 

260 265 270 

Ala Lys Thr Leu Gin Gin Leu Asn Ala Ser He Ala Val Glu Gly Leu 

275 280 285 

Asp Ala Lys Lys Val Ala Ala Asp Tyr Leu Lys Gin Lys Gly Trp Thr 
290 295 300 

Lys 
305 

<210> 395 
<211> 207 
<212> PRT 

<213> Escherichia coli 
<400> 395 

Met Leu Val Tyr Trp Leu Asp He Val Gly Thr Ala Val Phe Ala lie 

15 10 15 

Ser Gly Val Leu Leu Ala Gly Lys Leu Arg Met Asp Pro Phe Gly Val 

20 25 30 

Leu Val Leu Gly Val Val Thr Ala Val Gly Gly Gly Thr lie Arg Asp 

35 40 45 

Met Ala Leu Asp His Gly Pro Val Phe Trp Val Lys Asp Pro Thr Asp 

50 55 60 

Leu Val Val Ala Met Val Thr Ser Met Leu Thr He Val Leu Val Arg 
65 70 75 80 

Gin Pro Arg Arg Leu Pro Lys Trp Met Leu Pro Val Leu Asp Ala Val 

85 90 95 

Gly Leu Ala Val Phe Val Gly He Gly Val Asn Lys Ala Phe Asn Ala 

100 105 110 

Glu Ala Gly Pro Leu lie Ala Val Cys Met Gly Val He Thr Gly Val 

115 120 125 

Gly Gly Gly He He Arg Asp Val Leu Ala Arg Glu lie Pro Met He 

130 135 140 

Leu Arg Thr Glu lie Tyr Ala Thr Ala Cys lie lie Gly Gly He Val 
145 "* 150 155 160 

His Ala Thr Ala Tyr Tyr Thr Phe Ser Val Pro Leu Glu Thr Ala Ser 

165 170 175 

Met Met Gly Met Val Val Thr Leu Leu lie Arg Leu Ala Ala lie Arg 

180 185 190 

Trp His Leu Lys Leu Pro Thr Phe Ala Leu Asp Glu Asn Gly Arg 
195 200 205 

<210> 396 
<211> 266 
<212> PRT 
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<213> Escherichia coli 



<400> 396 



Met 


Ala 


uy o 


Ser 


Leu 


Phe Arg Ala Leu Val Ala Leu 


Ser 


Phe Leu Ala 


1 






-J 


10 




15 


Pro 


Leu 


Trn 


Leu 


Asn 


Ala Ala Pro Arg Val He Thr 


Leu 


Ser Pro Ala 










25 




30 


Asn 


Thr 




T on 


Al A 


Phe Ala Ala Gly He Thr Pro Val Gly Val Ser 






OO 






40 


45 




Ser Tyr 


Ser 




iyn 


Pro Pro Gin Ala Gin Lys He 


Glu 


Gin Val Ser 




50 








55 60 






Thr 


Trp 


Gin 




L JC L. 


Asn Leu Glu Arg He Val Ala 


Leu 


Lys Pro Asp 


65 






70 75 




80 


Leu 


Val 


lie 


n 1 a 
Hid 


Trp 


Arg Gly Gly Asn Ala Glu Arg Gin Val Asp Gin 










Q C 

oO 


90 




95 


Leu 


Ala 


Cnr 

oer 


Leu 


oxy 


He Lys Val Met Trp Val Asp 


Ala 


Thr Ser He 








1UU 




105 




110 


Glu 


Gin 


Tie 




As n 


Ala Leu Arg Gin Leu Ala Pro 


Trp 


Ser Pro Gin 






1 1 c 

J. 10 






120 


125 




Pro 


Asp 


Lys 


t\± td 


fll ii 


Gin Ala Ala Gin Ser Leu Leu Asp Gin Tyr Ala 




130 






135 140 






Gin 


Leu 




Ala 


Gin 


Tyr Ala Asp Lys Pro Lys Lys Arg 


Val Phe Leu 


145 








150 155 




160 


Gin 


Phe 


Gly 


lie 


Asn 


Pro Pro Phe Thr Ser Gly Lys 


Glu 


Ser He Gin 








165 


170 




175 


Asn 


Gin 


Val 


Leu 


Glu 


Val Cys Gly Gly Glu Asn He 


Phe 


Lys Asp Ser 








180 




185 




190 


Arg 


Val 


Pro 


Trp 


Pro 


Gin Val Ser Arg Glu Gin Val 


Leu Ala Arg Ser 




195 






200 


205 




Pro 


Gin 


Ala 


He 


Val 


He Thr Gly Gly Pro Asp Gin 


He 


Pro Lys He 




210 








215 " 220 






Lys 


Gin 


Tyr 


Trp 


Gly 


Glu Gin Leu Lys He Pro Val 


He 


Pro Leu Thr 


225 










230 235 




240 


Ser Asp 


Trp 


Phe 


Glu 


Arg Ala Ser Pro Arg He He 


Leu 


Ala Ala Gin 








245 


250 




255 


Gin 


Leu 


Cys 


Asn 


Ala 


Leu Ser Gin Val Asp 










260 




265 







<210> 397 
<211> 232 
<212> PRT 

<213> Escherichia coli 



<400> 397 



Met 


Lys 


He 


Gly 


He 


He Gly Ala 


1 








5 




Arg 


Asp 


Lys 


He 


Glu 


Asn Arg Gin 








20 






He 


Tyr 


Thr 


Gly 


Gin 


Leu Asn Gly 






35 






40 


Gly 


He 


Gly 


Lys 


Val 


Ala Ala Ala 




50 








55 


His 


Cys 


Lys 


Pro 


Asp 


Val He He 


65 










70 


Ala 


Pro 


Thr 


Leu 


Lys 


Val Gly Asp 










85 


Tyr 


His 


Asp 


Ala 


Asp 


Val Thr Ala 








100 






Pro 


Gly 


Cys 


Pro 


Ala 


Gly Phe Lys 






115 






120 



Met Glu Glu Glu Val Thr Leu Leu 

10 15 
Thr He Ser Leu Gly Gly Cys Glu 
25 30 
Thr Glu Val Ala Leu Leu Lys Ser 
45 

Leu Gly Ala Thr Leu Leu Leu Glu 
60 

Asn Thr Gly Ser Ala Gly Gly Leu 

75 80 
He Val Val Ser Asp Glu Ala Arg 

90 95 
Phe Gly Tyr Glu Tyr Gly Gin Leu 
105 HO 
Ala Asp Asp Lys Leu He Ala Ala 
125 
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Ala 


Glu 


Ala Cys 


He Ala 


Glu 


Leu 




130 






135 




He 


Val 


Ser Gly 


Asp Ala 


Phe 


He 


145 






150 






He 


Arg 


His Asn 


Phe Pro 


Gin 


Ala 






165 






Ala 


He 


Ala His 


Val Cys 


His 


Asn 






180 








Arg Ala 


He Ser 


Asp Val Ala 


Asp 






195 






200 


Glu 


Phe 


Leu Ala 


Val Ala 


Ala 


Lys 




210 






215 




Leu 


Val 


Gin Lys 


Leu Ala 


His 


Gly 


225 






230 







Asn Leu Asn Ala Val Arg Gly Leu 
140 

Asn Gly Ser Val Gly Leu Ala Lys 
155 160 
He Ala Val Glu Met Glu Ala Thr 

170 175 
Phe Asn Val Pro Phe Val Val Val 
185 190 
Gin Gin Ser His Leu Ser Phe Asp 
205 

Gin Ser Ser Leu Met Val Glu Ser 
220 



<210> 398 
<211> 262 
<212> PRT 

<213> Escherichia coli 



<400> 398 

Met He Asp Lys Ser Ala Phe Val His Pro Thr Ala He Val Glu Glu 

1 5 10 15 

Gly Ala Ser He Gly Ala Asn Ala His He Gly Pro Phe Cys He Val 

20 25 30 

Gly Pro His Val Glu He Gly Glu Gly Thr Val Leu Lys Ser His Val 

35 40 45 

Val Val Asn Gly His Thr Lys He Gly Arg Asp Asn Glu He Tyr Gin 

50 55 60 

Phe Ala Ser He Gly Glu Val Asn Gin Asp Leu Lys Tyr Ala Gly Glu 
65 70 75 80 

Pro Thr Arg Val Glu He Gly Asp Arg Asn Arg He Arg Glu Ser Val 

85 90 95 

Thr He His Arg Gly Thr Val Gin Gly Gly Gly Leu Thr Lys Val Gly 

100 105 110 

Ser Asp Asn Leu Leu Met He Asn Ala His He Ala His Asp Cys Thr 

115 120 125 

Val Gly Asn Arg Cys He Leu Ala Asn Asn Ala Thr Leu Ala Gly His 

130 ^ ~ 135 140 

Val Ser Val Asp Asp Phe Ala He He Gly Gly Met Thr Ala Val His 
145 ^ 150 155 160 

Gin Phe Cys He He Gly Ala His Val Met Val Gly Gly Cys Ser Gly 

165 170 175 

Val Ala Gin Asp Val Pro Pro Tyr Val He Ala Gin Gly Asn His Ala 

180 185 190 

Thr Pro Phe Gly Val Asn He Glu Gly Leu Lys Arg Arg Gly Phe Ser 

195 200 205 

Arg Glu Ala He Thr Ala He Arg Asn Ala Tyr Lys Leu He Tyr Arg 

210 215 220 

Ser Gly Lys Thr Leu Asp Glu Val Lys Pro Glu He Ala Glu Leu Ala 
225 230 235 240 

Glu Thr Tyr Pro Glu Val Lys Ala Phe Thr Asp Phe Phe Ala Arg Ser 

245 * 250 255 

Thr Arg Gly Leu He Arg 
260 



<210> 399 
<211> 382 
<212> PRT 

<213> Escherichia coli 
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<400> 399 

Met Thr Glu Gin Arg Pro Leu Thr He Ala Leu Val Ala Gly Glu Thr 

15 10 15 

Ser Gly Asp He Leu Gly Ala Gly Leu He Arg Ala Leu Lys Glu His 

20 25 30 

Val Pro Asn Ala Arg Phe Val Gly Val Ala Gly Pro Arg Met Gin Ala 

35 40 45 

Glu Gly Cys Glu Ala Trp Tyr Glu Met Glu Glu Leu Ala Val Met Gly 

50 55 60 

He Val Glu Val Leu Gly Arg Leu Arg Arg Leu Leu His He Arg Ala 
65 70 75 80 

Asp Leu Thr Lys Arg Phe Gly Glu Leu Lys Pro Asp Val Phe Val Gly 

85 90 95 

He Asp Ala Pro Asp Phe Asn He Thr Leu Glu Gly Asn Leu Lys Lys 

100 105 110 

Gin Gly He Lys Thr He His Tyr Val Ser Pro Ser Val Trp Ala Trp 

115 120 125 

Arg Gin Lys Arg Val Phe Lys He Gly Arg Ala Thr Asp Leu Val Leu 

130 135 140 

Ala Phe Leu Pro Phe Glu Lys Ala Phe Tyr Asp Lys Tyr Asn Val Pro 
145 150 155 160 

Cys Arg Phe He Gly His Thr Met Ala Asp Ala Met Pro Leu Asp Pro 

165 170 175 

Asp Lys Asn Ala Ala Arg Asp Val Leu Gly He Pro His Asp Ala His 

180 185 190 

Cys Leu Ala Leu Leu Pro Gly Ser Arg Gly Ala Glu Val Glu Met Leu 

195 200 205 

Ser Ala Asp Phe Leu Lys Thr Ala Gin Leu Leu Arg Gin Thr Tyr Pro 

210 215 220 

Asp Leu Glu He Val Val Pro Leu Val Asn Ala Lys Arg Arg Glu Gin 
225 230 235 240 

Phe Glu Arg He Lys Ala Glu Val Ala Pro Asp Leu Ser Val His Leu 

245 250 255 

Leu Asp Gly Met Gly Arg Glu Ala Met Val Ala Ser Asp Ala Ala Leu 

260 265 270 

Leu Ala Ser Gly Thr Ala Ala Leu Glu Cys Met Leu Ala Lys Cys Pro 

275 280 285 

Met Val Val Gly Tyr Arg Met Lys Pro Phe Thr Phe Trp Leu Ala Lys 

290 295 300 

Arg Leu Val Lys Thr Asp Tyr Val Ser Leu Pro Asn Leu Leu Ala Gly 
305 310 315 320 

Arg Glu Leu Val Lys Glu Leu Leu Gin Glu Glu Cys Glu Pro Gin Lys 

325 330 335 

Leu Ala Ala Ala Leu Leu Pro Leu Leu Ala Asn Gly Lys Thr Ser His 

340 345 350 

Ala Met His Asp Thr Phe Arg Glu Leu His Gin Gin He Arg Cys Asn 

355 * 360 365 

Ala Asp Glu Gin Ala Ala Gin Ala Val Leu Glu Leu Ala Gin 
370 375 380 

<210> 400 
<211> 198 
<212> PRT 

<213> Escherichia coli 
<400> 400 

Met He Glu Phe Val Tyr Pro His Thr Gin Leu Val Ala Gly Val Asp 

1 5 10 15 

Glu Val Gly Arg Gly Pro Leu Val Gly Ala Val Val Thr Ala Ala Val 
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20 










25 










30 






He 


Leu 


Asp 


Pro 


Ala 


Arg 


Pro 


He 


Ala 


Gly 


Leu 


Asn 


Asp 


Ser 


Lys 


Lys 






35 










40 










45 








Leu 


Ser 


Glu 


Lys 


Arg 


Arg 


Leu 


Ala 


Leu 


Tyr 


Glu 


Glu 


He 


Lys 


Glu 


Lys 




50 










55 










60 










Ala 


Leu 


Ser 


Trp 


Ser 


Leu 


Gly 


Arg 


Ala 


Glu 


Pro 


His 


Glu 


He 


Asp 


Glu 


65 










70 










75 










80 


Leu 


Asn 


He 


Leu 


His 


Ala 


Thr 


Met 


Leu 


Ala 


Met 


Gin 


Arg 


Ala 


Val 


Ala 










85 










90 










95 




Gly 


Leu 


His 


He 


Ala 


Pro 


Glu 


Tyr 


Val 


Leu 


He 


Asp 


Gly 


Asn 


Arg 


Cys 






100 










105 










110 






Pro 


Lys 


Leu 


Pro 


Met 


Pro 


Ala 


Met 


Ala 


Val 


Val 


Lys 


Gly 


Asp 


Ser 


Arg 




115 










120 










125 








Val 


Pro 


Glu 


He 


Ser 


Ala 


Ala 


Ser 


He 


Leu 


Ala 


Lys 


Val 


Thr 


Arg 


Asp 




130 










135 










140 










Ala 


Glu 


Met 


Ala 


Ala 


Leu 


Asp 


He 


Val 


Phe 


Pro 


Gin 


Tvr 


Gly 


Phe 


Ala 


145 










150 










155 










160 


Gin 


His 


Lys 


Gly 


Tyr 


Pro 


Thr 


Ala 


Phe 


His 


Leu 


Glu 


Lys 


Leu 


Ala 


Glu 










165 










170 










175 




His 


Gly 


Ala 


Thr 


Glu 


His 


His 


Arg 


Arg 


Ser 


Phe 


Gly 


Pro 


Val 


Lys 


Arg 






180 










185 










190 






Ala 


Leu 


Gly 


Leu 


Ala 


Ser 


























195 




























<210> 401 




























<211> 1160 




























<212> PRT 




























<213> Escherichia coli 






















<400> 401 




























Met 


Ser 


Glu 


Pro 


Arg 


Phe 


Val 


His 


Leu 


Arg 


Val 


His 


Ser 


Asp 


Tyr 


Ser 


1 








5 










10 










15 




Met 


He 


Asp 


Gly 


Leu 


Ala 


Lys 


Thr 


Ala 


Pro 


Leu 


Val 


Lys 


Lys 


Ala 


Ala 








20 










25 










30 






Ala 


Leu 


Gly 


Met 


Pro 


Ala 


Leu 


Ala 


He 


Thr 


Asp 


Phe 


Thr 


Asn 


Leu 


Cys 






35 










40 










45 








Gly 


Leu 


Val 


Lys 


Phe 


Tyr 


Gly 


Ala 


Gly 


His 


Gly 


Ala 


Gly 


He 


Lys 


Pro 




50 










55 










60 










He 


Val 


Gly 


Ala 


Asp 


Phe 


Asn 


Val 


Gin 


Cys 


Asp 


Leu 


Leu 


Gly 


Asp 


Glu 


65 










70 










75 










80 


Leu 


Thr 


His 


Leu 


Thr 


Val 


Leu 


Ala 


Ala 


Asn 


Asn 


Thr 


Gly 


Tyr 


Gin 


Asn 










85 










90 










95 




Leu 


Thr 


Leu 


Leu 


He 


Ser 


Lys 


Ala 


Tyr 


Gin 


Arg 


Gly 


Tyr 


Gly 


Ala 


Ala 








100 










105 










110 






Gly 


Pro 


He 


He 


Asp 


Arg 


Asp 


Trp 


Leu 


He 


Glu 


Leu 


Asn 


Glu 


Gly 


Leu 






115 










120 










125 








He 


Leu 


Leu 


Ser 


Gly 


Gly 


Arg 


Met 


Gly 


Asp 


Val 


Gly 


Arg 


Ser 


Leu 


Leu 




130 










135 










140 










Arg 


Gly 


Asn 


Ser 


Ala 


Leu 


Val 


Asp 


Glu 


Cys 


Val 


Ala 


Phe 


Tyr 


Glu 


Glu 


145 










150 










155 










160 


His 


Phe 


Pro 


Asp 


Arg 


Tyr 


Phe 


Leu 


Glu 


Leu 


He 


Arg 


Thr 


Gly 


Arg 


Pro 










165 










170 










175 




Asp 


Glu 


Glu 


Ser 


Tyr 


Leu 


His 


Ala 


Ala 


Val 


Glu 


Leu 


Ala 


Glu 


Ala 


Arg 






180 










185 










190 






Gly 


Leu 


Pro 


Val 


Val 


Ala 


Thr 


Asn 


Asp 


Val 


Arg 


Phe 


He 


Asp 


Ser 


Ser 




195 










200 










205 








Asp 


Phe 


Asp 


Ala 


His 


Glu 


He 


Arg 


Val 


Ala 


He 


His 


Asp 


Gly 


Phe 


Thr 




210 










215 










220 










Leu 


Asp 


Asp 


Pro 


Lys 


Arg 


Pro 


Arg 


Asn 


Tyr 


Ser 


Pro 


Gin 


Gin 


Tyr 


Met 



225 230 235 240 
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oiu 


VjIU 






vyo ox li 


Leu Phe Ala Asp 


He 


Pro 


Glu Ala 












250 








255 


Lieu Ala 


Asn 


J. nx 


val 


f^l 11 


Tip Ala 
lie nia 


Lys Arg Cys Asn 


Val 


Thr 


Val Arg 














265 






270 




Leu Gly 


OlU 


Tyr 


trne 


Leu 


pro oin 


Phe Pro Thr Gly 


Acn 


Met 


Ser Thr 


275 






280 






285 






Glu Asp 


Tyr 


Leu 


Vol 


Lys 


A ret A 1 a 
rVX y nXa 


Lys Glu Gly Leu 


Glu 


Glu 


Arn Leu 


290 








295 




300 








Ala D V> a 

Ala rne 


Leu 


rne 


Pro 


Hon 


nln Gin 

OX U UX U 


Glu Arg Leu 


Lys 


Arg 


Arg 


Pro Glu 










-> X w 




315 








320 


lyr Asp 


pin 

bill 


Arg 


Leu 


Gin 


1 Hi. oxu 


Leu Gin Val 


He 


Asn 


Gin 


Met Gly 












330 








335 


Phe Pro 


oiy 


Tyr 


rne 


T an 
lifer U 


Tip Va 1 
lie v ax 


Met Glu Phe 


lie 


Gin 


Tto 


Ser Lvs 












345 






350 




Asp Asn 


r*l .> 

Gly 


vai 


riO 


vai 


oiy rxO 


Gly Arg Gly Ser 


ox y 


Ala 


Gly Ser 


ODD 








360 






365 






Leu va± 


TV I 

Ala 


Tyr 


Ala 


Leu 


T uc Tie* 
liyo lie 


Thr Asp Leu Asp 


Pro 


Leu 


Glu Phe 


370 










■J i *j 




380 








Asp Leu 


Leu 


fne 


olU 


Arg 


true lieu. 


Asn Pro Glu Arg 


Val 


Ser 


Met Pro 


JO J 








390 




395 








400 


Asp Phe 


Asp 


Vol 


Asp 


php 

IT J le 


Cys Met 


Glu Lys Arg Asp 


Gin 


Val 


He Glu 








*i U U 






410 








415 


HIS vai 


Aid 


Asp 




iyr 


p ~\ w B rn 
ox y y 


Asp Ala Val 


Ser 


Gin 


He 


He Thr 














425 






430 




rne triy 


i nr 


Met 


Ala 


Al a 
MX d 


T \/c Ala 


Val lie Arg Asp 


Val 


Glv 

oxy 


Arg Val 


T J J 








440 






445 






Leu Gly 


nlS 


Prn 


Tyr 


ox y 


Php Val 
rue vax 


Asp Arg lie 


Ser 


Lys 


Leu 


He Pro 


450 










455 




460 








rxO ASp 


Pro 




He L. 


Thr 


T.cmi Al a 


Lys Ala Phe Glu 


Ala 


Glu 


Pro Gin 


HOJ 








470 




475 








480 


Leu Pro 


IJl u 


Tip 
1 le 


i yr 


Glu 


Ala Asd 


Glu Glu Val 


Lys 


Ala 


Leu 


He Asp 








485 






490 








4 95 


Met Ala 


Arg 


Lys 


Leu 




Gly Val 


Thr Arg Asn Ala 


Glv 

oxy 


Lys 


His Ala 






JUU 








505 






510 




biy oiy 


vai 


Va 1 
Vol 


Tip 

x Xtr 


Al a 


Prn Thi* 
nu x iix 


Lys lie Thr Asp 


Phe 


Ala 


Pro Leu 










520 






525 






Tyr Cys 


Asp 




ulU 


oiy 


Lys His 


Pro Val Thr 


Gin 


Phe 


Asp 


Lv^ Ser 












535 




540 








Asp vai 


oiu 


Tyr 


J_a 


ox y 


T.mi Val 
lieu vox 


Lys Phe Asp 


Phe 


Leu 


Glv 
oxy 


Leu Aro 










U *J u 




555 








560 


Thr Leu 


i nr 


lie 


i ie 


Asn 


Trn Hi a 
l xp nia 


Leu Glu Met 


He 


Asn 


Lys 


Arn Am 








JD J 






570 








575 


Ala Lys 


Asn 


oiy 


ulU 


rr x w 


Pro Leu 


Asp lie Ala 


Ala 


He 


Pro 


Leu Asd 






JOU 








585 






590 




Asp Liys 


Lys 


Ser 


tr Iltr 


7\ «ri 
Mop 


Mpf T.pii 
L J tr U Jjc U 


Gin Arg Ser 


Glu 


Thr 


Thr 


Ala Val 












600 






605 






rile vjIU 


T on 


OX LI 


Cqv- 
OCX 


"X y 


Gly Met 


Lys Asp Leu 


He 


Lys 


Arg 


Leu Gin 


610 










615 




620 








Orrt Son 
rxU rlSkv 


Pt/Q 


it I xtr 


Glu 


Asp 


Met lie 


Ala Leu Val 


Ala 


Leu 


Phe 


Arg Pro 










630 




635 








640 


oiy XTiO 


Leu 


bill 


C o V 


riii 
uiy 


Mpf Val 
L JC l. vax 


Asp Asn Phe 


He 


Asp 


Arg 


Lys His 






645 






650 








655 






Glu 


lie 


Ser 


Tyr Pro 


Asp Val Gin 


Trp 


Gin 


His 


Glu Ser 




660 








665 






670 




Leu Lys 


Pro 


Val 


Leu 


Glu 


Pro Thr 


Tyr Gly lie 


He 


Leu 


Tyr 


Gin Glu 


675 








680 






685 






Gin Val 


Met 


Gin 


lie 


Ala 


Gin Val 


Leu Ser Gly Tyr 


Thr 


Leu 


Gly Gly 


690 










695 




700 








Ala Asp 


Met 


Leu 


Arg 


Arg 


Ala Met 


Gly Lys Lys 


Lys 


Pro 


Glu 


Glu Met 


705 








710 




715 








720 
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Ala 


Lys 


Gin 


Arg 


Ser 


Val 


Phe 


Ala 


Glu 


Gly 


Ala 


Glu 


Lys 


Asn 


Gly 


lie 








725 










730 










735 




Asn 


Ala 


Glu 


Leu 
740 


Ala 


Met 


Lys 


He 


Phe 
745 


Asp 


Leu 


Val 


Glu 


Lys 
750 


Phe 


Ala 


Gly Tyr 


Gly 


Phe 


Asn 


Lys 


Ser 


His 


Ser 


Ala 


Ala 


Tyr 


Ala 


Leu 


Val 


Ser 






755 










760 










765 








Xyr 


Gin 


Thr 


Leu 


Trp Leu 


Lys 


Ala 


His 


Tyr 


Pro 


Ala 


Glu 


Phe 


Met 


Ala 


770 










775 










780 










Ala 


Val 


Met 


Thr 


Ala Asp 


Met 


Asp 


Asn 


Thr 


Glu 


Lys 


Val 


Val 


Gly 


Leu 


785 










790 










795 










800 


Val 


Asp 


Glu 


Cys 


Trp Arg 


Met 


Gly 


Leu 


Lys 


He 


Leu 


Pro 


Pro 


Asp 


lie 








805 










810 










815 




Asn 


Ser 


Gly 


Leu 


Tyr 


His 


Phe 


His 


Val 


Asn 


Asp 


Asp 


Gly 


Glu 


He 


Val 






820 










825 










830 






Tyr Gly 


He 


Gly 


Ala 


He 


Lys 


Gly 


Val 


Gly 


Glu 


Gly 


Pro 


He 


Glu 


Ala 






835 










840 










845 








He 


lie 


Glu 


Ala 


Arg Asn 


Lys 


Gly 


Gly 


Tyr 


Phe 


Arg 


Glu 


Leu 


Phe 


Asp 




850 










855 










860 










Leu 


Cys 


Ala 


Arg 


Thr Asp 


Thr 


Lys 


Lys 


Leu 


Asn 


Arg 


Arg 


Val 


Leu 


Glu 


865 








870 










875 










880 


Lys 


Leu 


He 


Met 


Ser Gly 


Ala 


Phe 


Asp 


Arg 


Leu 


Gly 


Pro 


His 


Arg 


Ala 








885 










890 










895 




Ala 


Leu 


Met 


Asn 
900 


Ser 


Leu 


Gly 


Asp 


Ala 
905 


Leu 


Lys 


Ala 


Ala 


Asp 
910 


Gin 


His 


Ala 


Lys 


Ala 


Glu 


Ala 


He 


Gly 


Gin 


Ala 


Asp 


Met 


Phe 


Gly 


Val 


Leu 


Ala 




915 










920 










925 








Glu 


Glu 
930 


Pro 


Glu 


Gin 


He 


Glu 
935 


Gin 


Ser 


Tyr 


Ala 


Ser 
940 


Cys 


Gin 


Pro 


Trp 


Pro 


Glu 


Gin 


Val 


Val 


Leu 


Asp 


Gly 


Glu 


Arg 


Glu 


Thr 


Leu 


Gly 


Leu 


Tyr 


945 










950 










955 










960 


Leu 


Thr 


Gly 


His 


Pro 


lie 


Asn 


Gin 


Tyr 


Leu 


Lys 


Glu 


lie 


Glu 


Arg 


Tyr 








965 










970 










975 




Val 


Gly 


Gly 


Val 
980 


Arg 


Leu 


Lys 


Asp 


Met 
985 


His 


Pro 


Thr 


Glu 


Arg 
990 


Gly 


Lys 


Val 


He 


Thr 


Ala 


Ala 


Gly 


Leu 


Val 


Val 


Ala 


Ala 


Arg 


Val 


Met 


Val 


Thr 






995 










1000 








1005 






Lys 


Arg 


Gly 


Asn 


Arg 


He 


Gly 


He Cys 


Thr 


Leu 


Asp 


Asp Arg 


Ser 


Gly 


1010 








1015 








1020 








Arg Leu Glu 


Val 


Met 


Leu 


Phe 


Thr Asp 


Ala 


Leu 


Asp Lys Tyr 


Gin 


Gin 


1025 








1030 








1035 








1040 


Leu 


Leu 


Glu 


Lys 


Asp 


Arg 


He 


Leu 


He 


Val 


Ser Gly Gin Val 


Ser 


Phe 



1045 1050 1055 



Asp Asp Phe Ser Gly Gly Leu Lys Met Thr Ala Arg Glu Val Met Asp 

1060 1065 1070 

lie Asp Glu Ala Arg Glu Lys Tyr Ala Arg Gly Leu Ala lie Ser Leu 

1075 1080 1085 

Thr Asp Arg Gin He Asp Asp Gin Leu Leu Asn Arg Leu Arg Gin Ser 

1090 1095 1100 

Leu Glu Pro His Arg Ser Gly Thr lie Pro Val His Leu Tyr Tyr Gin 
1105 1110 1115 1120 

Arg Ala Asp Ala Arg Ala Arg Leu Arg Phe Gly Ala Thr Trp Arg Val 

1125 1130 1135 

Ser Pro Ser Asp Arg Leu Leu Asn Asp Leu Arg Gly Leu lie Gly Ser 

1140 1145 1150 

Glu Gin Val Glu Leu Glu Phe Asp 
1155 1160 

<210> 402 
<211> 239 
<212> PRT 
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<213> Escherichia coli 



<400> 402 



Met Asn 


val 


Don 


Phe 


Phe 


vox 


Thr 


Cys 


He 


Glv 
vjxy 


Asp 


Ala 


Leu 


Lys 


Ser 


l 






5 










i ft 

x v 










15 




n rrx Mat- 

rtxy wet. 


Al a 


7\ TCI 

/vrg 


Asp 


Ser 


Val 


Leu 


Leu 


Leu 


Glu 


Lys 


Leu 


Glv 


Cys 


nx y 
















25 










30 






vax Asn 


rne 


Pro 


Glu 


Lys 


Gl n 


Gly 


Cys 


Cys 


Glv 
\axy 


Gin 


Pro 


Ala 


He 


Asn 
























45 








Ser Gly 


Tyr 


Tip 
116 


Lys 


Glu 


Ala 


He 


Pro 


Glv 


Met 


Lys 


Asn 


Leu 


He 


Ala 












5 5 










60 










Ala Leu 


ri n 
ulu 


Asp 


Asn Asp 


Asp 


Pro 


He 


He 


Ser 


Pro 


Ala 


Glv 


Ser 


Cys 


Oj 








70 










75 










80 


mr Ayr 


Ala 


Vol 


Lys 


Ser 


i yx 


Pro 


Thr 


T vr 
lyx 


Leu 


Ala 


Asp 


Glu 


Pro 


Glu 






85 










gn 










95 




Trp Ala 


Ser 


Arg 


Ala 


Ala 


Lys 


Val 


Ala 


Ala 


Arg 


Met 


Gin 


Asp 


Leu 


Thr 






1UU 










105 

1 u J 










110 






Ser Phe 


tIq 
1 le 


Vdl 


Asn 


Lys 




Gly 


Val 


Val 


Asp 


Val 


Gly 


Ala 


Ser 


Leu 




11 J 










120 










125 










Arg 


rtxa 


Val 


Tyr 


His 


Pro 


Ser 


Cys 


Ser 


Leu 


Ala 


Arg 


Lys 


Leu 


1 7ft 




















140 










'jj.y vciJ. 


Lys 


Asp 


Glu 


Pro 


Leu 


Thr 


Leu 


Leu 


Lys 


Asn 


Val 


Arg 


Glv 


Leu 


145 








150 










155 










160 


Glu Leu 


Leu 


Thr 


Phe 


Ala 


Glu 


Gin 


Asp 


Thr 


Cys 


Cys 


Gly 


Phe 


Gly 


Gly 








165 










170 










175 




Thr Phe 


Ser 


vcti 


Lys 


Met 


Ala 


Glu 


He 


Ser 


Glv 


Glu 


Met 


Val 


Lys 


Glu 






180 










185 










190 






Lys Val 


Ala 


His 


Leu 


Met 


Glu 


Val 


Arg 


Pro 


Glu 


Tyr 


Leu 


He 


Gly 


Ala 


195 










200 










205 








Asp Val 


Ser 


Cys 


Leu 


Leu 


Asn 


He 


Ser 


Gly 


Arg 


Leu 


Gin 


Arg 


Glu 


Glv 

uxy 


210 










215 










220 










Gin Lys 


Val 


Lys 


Val 


Met 


His 


He 


Ala 


Glu 


Val 


Leu 


Met 


Ser 


Arg 




225 








230 










235 












<210> 403 




























<211> 475 




























<212> PRT 




























<213> Escherichia coli 






















<400> 403 




























Met Ser 


He 


Lys 


Thr 


Ser 


Asn 


Thr 


Asp 


Phe 


Lys 


Thr 


Arg 


He 


Arg 


Gin 


1 






5 










10 










15 




Gin He 


Glu 


Asp 


Pro 


He 


Met 


Arg 


Lys 


Ala 


Val 


Ala 


Asn 


Ala 


Gin 


Gin 






20 










25 










30 






Arg He 


Gly 


Ala 


Asn 


Arg 


Gin 


Lys 


Met 


Val 


Asp 


Glu 


Leu 


Glv 


His 


TrD 




35 










40 










45 








Glu Glu 


Trp 


Arg 


Asp 


Arg 


Ala 


Ala 


Gin 


He 


Arg 


Aso 


His 


Val 


Leu 


Ser 


50 










55 










60 










Asn Leu 


Asp 


Ala 


Tyr 


Leu 


±yx 


Gin 


Leu 


Ser 


Glu 


Lys 


Val 


Thr 


Gin 


Asn 


65 








70 










75 










80 


Gly Gly 


His 


Val 


Tyr 


Phe 


Ala 


Arg 


Thr 


Lys 


Glu 


Asp 


Ala 


Thr 


Arg 


Tyr 








85 










90 










95 




He Leu 


Gin 


Val 


Ala 


Gin 


Arg 


Lys 


Asn 


Ala 


Arg 


Lys 


Val 


Val 


Lys 


Ser 






100 










105 










110 






Lys Ser 


Met 


Val 


Thr 


Glu 


Glu 


He 


Gly 


Val 


Asn 


His 


Val 


Leu 


Gin 


Asp 


115 










120 










125 








Ala Gly 


He 


Gin 


Val 


He 


Glu 


Thr 


Asp 


Leu 


Gly 


Glu 


Tyr 


He 


Leu 


Gin 


130 










135 










140 










Leu Asp 


Gin 


Asp 


Pro 


Pro 


Ser 


His 


Val 


Val 


Val 


Pro 


Ala 


He 


His 


Lys 


145 








150 










155 










160 
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Asp Arg 


His 


Gin 


He Arg 


Arg 


Val 


Leu 


His 


Glu 


Arg 


Leu 


Gly 


Tyr 


Glu 








165 








170 










175 




Gly Pro 


Glu 


Thr 


Pro Glu 


Ala 


Met 


Thr 


Leu 


Phe 


He 


Arg 


Gin 


Lys 


He 






180 








185 










190 






Arg Glu 


Asp 


Phe 


Leu Ser 


Ala 


Glu 


He 


Gly 


He 


Thr 


Gly 


Cys 


Asn 


Phe 




195 








200 










205 








Ala Val 


Ala 


Glu 


Thr Gly 


Ser 


Val 


Cys 


Leu 


Val 


Thr 


Asn 


Glu 


Gly 


Asn 


210 








215 










220 










Ala Arg 


Met Cys 


Thr Thr 


Leu 


Pro 


Lys 


Thr 


His 


He 


Ala 


Val 


Met 


Gly 


225 






230 










235 










240 


Met Glu 


Arg 


He 


Ala Pro 


Thr 


Phe 


Ala 


Glu 


Val 


Asp 


Val 


Leu 


He 


Thr 






245 








250 










255 




Met Leu 


Ala 


Arg 


Ser Ala 


Val 


Gly 


Ala 


Arg 


Leu 


Thr 


Gly 


Tyr 


Asn 


Thr 






260 








265 










270 






Trp Leu 


Thr Gly 


Pro Arg 


Glu 


Ala 


Gly 


His 


Val 


Asp 


Gly 


Pro 


Glu 


Glu 




275 








280 










285 








Phe His 


Leu 


Val 


He Val 


Asp 


Asn 


Gly 


Arg 


Ser 


Glu 


Val 


Leu 


Ala 


Ser 


290 








295 










300 










Glu Phe 


Arg 


Asp 


Val Leu 


Arg 


Cys 


He 


Arg 


Cys 


Gly 


Ala 


Cys 


Met 


Asn 


305 






310 










315 










320 


Thr Cys 


Pro 


Ala 


Tyr Arg 


His 


He 


Gly 


Gly 


His 


Gly 


Tyr 


Gly 


Ser 


He 








325 








330 










335 




Tyr Pro 


Gly 


Pro 


He Gly 


Ala 


Val 


He 


Ser 


Pro 


Leu 


Leu 


Gly 


Gly 


Tyr 




340 








345 










350 






Lys Asp 


Phe 


Lys 


Asp Leu 


Pro 


Tyr 


Ala 


Cys 


Ser 


Leu 


Cys 


Thr 


Ala 


Cys 




355 








360 










365 








Asp Asn 


Val 


Cys 


Pro Val 


Arg 


He 


Pro 


Leu 


Ser 


Lys 


Leu 


He 


Leu 


Arg 


370 








375 










380 










His Arg 


Arg Val 


Met Ala 


Glu 


Lys 


Gly 


He 


Thr 


Ala 


Lys 


Ala 


Glu 


Gin 


385 






o q n 
390 






















Arg Ala 


He 


Lys 


Met Phe 


Ala 


Tyr 


Ala 


Asn 


Ser 


His 


Pro 


Gly 


Leu 


Trp 






405 








410 










415 




Lys Val 


Gly Met 


Met Ala 


Gly 


Ala 


His 


Ala 


Ala 


Ser 


Trp 


Phe 


He 


Asn 






420 








425 










430 






Gly Gly 


Lys 


Thr 


Pro Leu 


Lys 


Phe 


Gly 


Ala 


He 


Ser 


Asp 


Trp 


Met 


Glu 




435 








440 










445 








Ala Arg 


Asp Leu 


Pro Glu 


Ala 


Asp 


Gly 


Glu 


Ser 


Phe 


Arg 


Ser 


Trp 


Phe 



450 455 460 



Lys Lys His Gin Ala Gin Glu Lys Lys Asn Gly 
465 470 475 

<210> 404 
<211> 282 
<212> PRT 

<213> Escherichia coli 



<400> 404 



Met 


Leu 


Met 


Arg 


Gin 


Ala 


Gly 


Leu 


Ser 


Met 


Ala 


Ala 


Lys 


His 


His 


Ser 


1 








5 










10 










15 




Asn 


Leu 


Ala 


Arg 


Leu 


Ala 


Thr 


Gly 


Trp 


Lys 


His 


Ala 


He 


Phe 


Leu 


Lys 








20 










25 










30 






Leu 


Thr 


Glu 


Arg 


Val 


Ser 


Val 


Val 


Gly 


Leu 


Arg 


Asn 


He 


Arg 


Arg 


Arg 






35 










40 










45 








Arg 


Lys 


Arg 


Met 


Asp 


Asn 


Arg 


Gly 


Glu 


Phe 


Leu 


Asn 


Asn 


Val 


Ala 


Gin 




50 










55 










60 










Ala 


Leu 


Gly 


Arg 


Pro 


Leu 


Arg 


Leu 


Glu 


Pro 


Gin 


Ala 


Glu 


Asp 


Ala 


Pro 


65 










70 










75 










80 


Leu 


Asn 


Asn 


Tyr 


Ala 


Asn 


Glu 


Arg 


Leu 


Thr 


Gin 


Leu 


Asn 


Gin 


Gin 


Gin 










85 










90 










95 




Arg 


Cys 


Asp 


Ala 


Phe 


He 


Gin 


Phe 


Ala 


Ser 


Asp 


Val 


Met 


Leu 


Thr 


Arg 
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100 105 110 

Cys Glu Leu Thr Ser Glu Ala Lys Ala Ala Glu Ala Ala lie Arg Leu 

115 120 125 

Cys Lys Glu Leu Gly Asp Gin Ser Val Val He Ser Gly Asp Thr Arg 

130 135 140 

Leu Glu Glu Leu Gly He Ser Glu Arg Leu Gin Gin Glu Cys Asn Ala 
145 150 155 160 

Val Val Trp Asp Pro Ala Lys Gly Ala Glu Asn lie Ser Gin Ala Glu 

165 170 175 

Gin Ala Lys Val Gly Val Val Tyr Ala Glu Tyr Gly Leu Thr Glu Ser 

180 185 190 

Gly Gly Val Val Leu Phe Ser Ala Ala Glu Arg Gly Arg Ser Leu Ser 

195 200 205 

Leu Leu Pro Glu Tyr Ser Leu Phe He Leu Arg Lys Ser Thr He Leu 

210 215 220 

Pro Arg Val Ala Gin Leu Ala Glu Lys Leu His Gin Lys Ala Gin Ala 
225 230 235 240 

Gly Glu Arg Met Pro Ser Cys He Asn He He Ser Gly Pro Ser Ser 

245 250 255 

Thr Ala Asp He Glu Leu He Lys Val Val Gly Val His Gly Pro Val 

260 265 270 

Lys Ala Val Tyr Leu He He Glu Asp Cys 
275 280 

<210> 405 

<211> 390 

<212> PRT 

<213> Escherichia coli 



<400> 405 



Met 


Ser 


Glu 


He 


Ala 


Met His Val Thr Ala Lys Pro Ser Ser 


Phe 


Gin 


1 








5 


10 


15 




Cys 


Asn 


Leu 


Lys 


Cys 


Asp Tyr Cys Phe Tyr Leu Glu Lys Glu Ser 


Gin 






20 




25 30 






Phe 


Thr 


His 
35 


Glu 


Lys 


Trp Met Asp Asp Ser Thr Leu Lys Glu 
40 45 


Phe 


He 


Lys 


Gin 


Tyr 


He 


Ala 


Ala Ser Gly Asn Gin Val Tyr Phe Thr 


Trp 


Gin 


50 








55 60 






Gly 


Gly 


Glu 


Pro 


Thr 


Leu Ala Gly Leu Asp Phe Phe Arg Lys 


Val 


He 


65 








70 75 




80 


His 


Tyr 


Gin 


Gin 


Arg 


Tyr Ala Gly Gin Lys Arg He Phe Asn Ala 


Leu 








85 


90 


95 




Gin 


Thr 


Asn 


Gly 
100 


He 


Leu Leu Asn Asn Glu Trp Cys Ala Phe 
105 HO 


Leu 


Lys 


Glu 


His 


Glu 
115 


Phe 


Leu 


Val Gly He Ser He Asp Gly Pro Gin 
120 125 


Glu 


Leu 


His 


Asp 
130 


Arg 


Tyr 


Arg 


Arg Ser Asn Ser Gly Asn Gly Thr Phe 
135 140 


Ala 


Lys 


Val 


lie 


Ala 


Ala 


He 


Glu Arg Leu Lys Ser Tyr Gin Val Glu 


Phe 


Asn 


145 










150 155 




160 


Thr 


Leu 


Thr 


Val 


He 
165 


Asn Asn Val Asn Val His Tyr Pro Leu 
170 


Glu 
175 


Val 


Tyr 


His 


Phe 


Leu 


Lys 


Ser lie Gly Ser Lys His Met Gin Phe 


He 


Glu 






180 




185 190 






Leu 


Leu 


Glu 


Thr 


Gly 


Thr Pro Asn He Asp Phe Ser Gly His 


Ser 


Glu 






195 




200 205 






Asn 


Thr 


Phe 


Arg 


He 


He Asp Phe Ser Val Pro Pro Thr Ala Tyr Gly 




210 








215 220 






Lys 


Phe 


Met 


Ser 


Thr 


He Phe Met Gin Trp Val Lys Asn Asp Val Gly 



225 230 235 240 
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Glu lie Phe lie Arg Gin Phe Glu Ser Phe Val Ser Arg Phe Leu Gly 

245 250 255 

Asn Gly His Thr Ser Cys lie Phe Gin Glu Ser Cys Lys Asp Asn Leu 

260 265 270 

Val Val Glu Ser Asn Gly Asp He Tyr Glu Cys Asp His Phe Val Tyr 

275 280 285 

Pro Gin Tyr Lys He Gly Asn He Asn Lys Ser Glu Leu Lys Thr Met 

290 295 300 

Asn Ser Val Gin Leu Thr Ala Gin Lys Lys Arg He Pro Ala Lys Cys 
305 310 315 320 

Gin Gin Cys Ala Tyr Lys Pro He Cys Asn Gly Gly Cys Pro Lys His 

325 330 335 

Arg He Thr Lys Val Asn Asn Glu Thr Val Ser Tyr Phe Cys Glu Gly 

340 345 350 

Tyr Lys He Leu Phe Ser Thr Met Val Pro Tyr Met Asn Ala Met Val 

355 360 365 

Glu Leu Ala Lys Asn Arg Val Pro Leu Tyr His He Met Asp Val Ala 

370 375 380 

Lys Gin Met Glu Asn Asn 
385 390 

<210> 406 
<211> 571 
<212> PRT 

<213> Escherichia coli 
<400> 406 

Met Glu Leu Ser Thr He Arg Arg Gly Thr Phe Met Lys Ser Ala Leu 

15 10 15 

Lys Lys Ser Val Val Ser Thr Ser He Ser Leu He Leu Ala Ser Gly 

20 25 30 

Met Ala Ala Phe Ala Ala His Ala Ala Asp Asp Val Lys Leu Lys Ala 

35 40 45 

Thr Lys Thr Asn Val Ala Phe Ser Asp Phe Thr Pro Thr Glu Tyr Ser 

50 55 60 

Thr Lys Gly Lys Pro Asn He He Val Leu Thr Met Asp Asp Leu Gly 
65 ~ 70 75 80 

Tyr Gly Gin Leu Pro Phe Asp Lys Gly Ser Phe Asp Pro Lys Thr Met 

85 90 95 

Glu Asn Arg Glu Val Val Asp Thr Tyr Lys He Gly He Asp Lys Ala 

100 105 HO 

He Glu Ala Ala Gin Lys Ser Thr Pro Thr Leu Leu Ser Leu Met Asp 

115 120 125 

Glu Gly Val Arg Phe Thr Asn Gly Tyr Val Ala His Gly Val Ser Gly 

130 135 140 

Pro Ser Arg Ala Ala He Met Thr Gly Arg Ala Pro Ala Arg Phe Gly 
145 150 155 160 

Val Tyr Ser Asn Thr Asp Ala Gin Asp Gly He Pro Leu Thr Glu Thr 

165 170 175 

Phe Leu Pro Glu Leu Phe Gin Asn His Gly Tyr Tyr Thr Ala Ala Val 

180 185 190 

Gly Lys Trp His Leu Ser Lys lie Ser Asn Val Pro Val Pro Glu Asp 

195 200 205 

Lys Gin Thr Arg Asp Tyr His Asp Asn Phe Thr Thr Phe Ser Ala Glu 

210 215 220 

Glu Trp Gin Pro Gin Asn Arg Gly Phe Asp Tyr Phe Met Gly Phe His 
225 230 235 240 

Ala Ala Gly Thr Ala Tyr Tyr Asn Ser Pro Ser Leu Phe Lys Asn Arg 

245 250 255 

Glu Arg Val Pro Ala Lys Gly Tyr lie Ser Asp Gin Leu Thr Asp Glu 
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260 265 270 

Ala He Gly Val Val Asp Arg Ala Lys Thr Leu Asp Gin Pro Phe Met 

275 280 285 

Leu Tyr Leu Ala Tyr Asn Ala Pro His Leu Pro Asn Asp Asn Pro Ala 

290 295 300 

Pro Asp Gin Tyr Gin Lys Gin Phe Asn Thr Gly Ser Gin Thr Ala Asp 
305 310 315 320 

Asn Tyr Tyr Ala Ser Val Tyr Ser Val Asp Gin Gly Val Lys Arg He 

325 330 335 

Leu Glu Gin Leu Lys Lys Asn Gly Gin Tyr Asp Asn Thr He He Leu 

340 345 350 

Phe Thr Ser Asp Asn Gly Ala Val He Asp Gly Pro Leu Pro Leu Asn 

355 360 365 

Gly Ala Gin Lys Gly Tyr Lys Ser Gin Thr Tyr Pro Gly Gly Thr His 

370 " " 375 380 

Thr Pro Met Phe Met Trp Trp Lys Gly Lys Leu Gin Pro Gly Asn Tyr 
385 390 395 400 

Asp Lys Leu He Ser Ala Met Asp Phe Tyr Pro Thr Ala Leu Asp Ala 

405 410 415 

Ala Asp He Ser He Pro Lys Asp Leu Lys Leu Asp Gly Val Ser Leu 

420 425 430 

Leu Pro Trp Leu Gin Asp Lys Lys Gin Gly Glu Pro His Lys Asn Leu 

435 440 445 

Thr Trp He Thr Ser Tyr Ser His Trp Phe Asp Glu Glu Asn He Pro 

450 455 460 

Phe Trp Asp Asn Tyr His Lys Phe Val Arg His Gin Ser Asp Asp Tyr 
465 * 470 475 480 

Pro His Asn Pro Asn Thr Glu Asp Leu Ser Gin Phe Ser Tyr Thr Val 

485 490 495 

Arg Asn Asn Asp Tyr Ser Leu Val Tyr Thr Val Glu Asn Asn Gin Leu 

500 505 510 

Gly Leu Tyr Lys Leu Thr Asp Leu Gin Gin Lys Asp Asn Leu Ala Ala 

515 520 525 

Ala Asn Pro Gin Val Val Lys Glu Met Gin Gly Val Val Arg Glu Phe 

530 535 540 

He Asp Ser Ser Gin Pro Pro Leu Ser Glu Val Asn Gin Glu Lys Phe 
545 550 555 560 

Asn Asn He Lys Lys Ala Leu Ser Glu Ala Lys 
565 570 

<210> 407 
<211> 274 
<212> PRT 

<213> Escherichia coli 
<400> 407 

Met Thr Glu Phe Thr Thr Leu Leu Gin Gin Gly Asn Ala Trp Phe Phe 

1 5 10 15 

He Pro Ser Ala He Leu Leu Gly Ala Leu His Gly Leu Glu Pro Gly 

20 25 30 

His Ser Lys Thr Met Met Ala Ala Phe He He Ala He Lys Gly Thr 

35 40 45 

He Lys Gin Ala Val Met Leu Gly Leu Ala Ala Thr He Ser His Thr 

50 55 60 

Ala Val Val Trp Leu He Ala Phe Gly Gly Met Val He Ser Lys Arg 
65 70 75 80 

Phe Thr Ala Gin Ser Ala Glu Pro Trp Leu Gin Leu He Ser Ala Val 

85 90 95 

He He He Ser Thr Ala Phe Trp Met Phe Trp Arg Thr Trp Arg Gly 
100 105 HO 
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Glu 


Arg 


Asn 


Trp 


Leu 


Glu 


Asn Met His Gly His 


Asp 


Tyr Glu 


His 


His 






115 








120 




125 






His 


His 


Asp 


His 


Glu 


His 


His His Asp His Gly 


His 


His His 


His 


His 




130 










135 


140 








Glu 


His 


Gly 


Glu 


Tyr 


Gin 


Asp Ala His Ala Arg Ala 


His Ala 


Asn 


Asp 


145 










150 


155 








160 


lie 


Lys 


Arg 


Arg 


Phe 


Asp 


Gly Arg Glu Val Thr Asn 


Trp Gin 


He 


Leu 








165 




170 






175 




Leu 


Phe 


Gly 


Leu 


Thr 


Gly 


Gly Leu He Pro Cys 


Pro 


Ala Ala 


He 


Thr 






180 






185 




190 






Val 


Leu 


Leu 


He 


Cys 


He 


Gin Leu Lys Ala Leu 


Thr 


Leu Gly 


Ala 


Thr 






195 








200 




205 






Leu 


Val 


Val 


Ser 


Phe 


Ser 


He Gly Leu Ala Leu 


Thr 


Leu Val 


Thr 


Val 




210 










215 


220 








Gly 


Val 


Gly 


Ala 


Ala 


He 


Ser Val Gin Gin Val 


Ala 


Lys Arg 


Trp Ser 


225 










230 


235 








240 


Gly 


Phe 


Asn 


Thr 


Leu 


Ala 


Lys Arg Ala Pro Tyr 


Phe 


Ser Ser 


Leu 


Leu 








245 




250 






255 




He 


Gly 


Leu 


Val 


Gly 


Val 


Tyr Met Gly Val His 


Gly 


Phe Met 


Gly 


He 






260 






265 




270 






Met 


Arg 





















<210> 408 
<211> 172 
<212> PRT 

<213> Escherichia coli 



<400> 408 



Met 


He 


Leu 


Lys 


Ser 


Ala 


He Ser Ala Asp 


Ser Leu 


Leu 


Ala 


Lys Asp 


1 








5 




10 








15 


Ala 


Phe 


Arg 


Ala 


Ser 


Phe 


His Leu His Phe 


Leu Arg 


Asn 


His 


Gly He 








20 






25 






30 




Thr 


Asn 


Lys 


He 


Ser 


Leu 


Val Ser Tyr He 


Val Trp 


Gin 


Glu 


Arg Tyr 






35 








40 




45 






Ala 


Thr 


Asp 


He 


Thr 


Asp 


Pro Gin Ser Gly Glu Phe 


Met 


Thr 


He Lys 




50 










55 


60 








Asn 


Lys 


Met 


Leu 


Leu 


Gly 


Ala Leu Leu Leu 


Val Thr 


Ser 


Ala 


Ala Trp 


65 










70 




75 






80 


Ala 


Ala 


Pro 


Ala 


Thr 


Ala 


Gly Ser Thr Asn 


Thr Ser 


Gly 


He 


Ser Lys 










85 




90 








95 


Tyr 


Glu 


Leu 


Ser 


Ser 


Phe 


He Ala Asp Phe 


Lys His 


Phe 


Lys 


Pro Gly 








100 






105 






110 




Asp 


Thr 


Val 


Pro 


Glu 


Met 


Tyr Arg Thr Asp Glu Tyr 


Asn 


He 


Lys Gin 






115 








120 




125 






Trp 


Gin 


Leu 


Arg Asn 


Leu 


Pro Ala Pro Asp 


Ala Gly 


Thr 


His 


Trp Thr 




130 










135 


140 








Tyr 


Met 


Gly 


Gly Ala 


Tyr 


Val Leu He Ser 


Asp Thr 


Asp Gly 


Lys He 


145 










150 




155 






160 


He 


Lys 


Ala 


Tyr Asp 


Gly 


Glu He Phe Tyr 


His Arg 
















165 




170 











<210> 409 
<211> 567 
<212> PRT 

<213> Escherichia coli 
<400> 409 

Met Ala Phe Val Ser Glu Arg Glu He Val Arg Lys He Phe Ser Lys 
15 10 15 
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Lys 


He 


Asp 


Phe 


Thr 


He 


Leu 


TV "1 _ 

Ala 








20 










Phe 


Leu 


Leu 


Cys 


Ser 


Gly 


Val 


Leu 






35 










4U 


Thr 


Lys 


Gly 


Asn 


Cys 


Tyr 


Glu 


Cys 




50 










55 




Gin 


Phe 


Tyr 


Phe 


Ser 


Leu 


Glu 


Thr 


65 








7U 






Ala 


Val 


Phe 


He 


Ala 


Ser 


Val 


Phe 










85 








Leu 


Thr 


Leu 


Leu 


Ala 


He 


Thr 


Trp 








100 










Ala 


Leu 


He 


His 


Ala 


Leu 


Asp Asn 






115 










120 


Asn 


He 


Leu 


Tyr 


Asn 


Leu 


Phe Gly 




130 










135 




Cys 


Leu 


Ser 


Asn 


Ser 


Leu 


Leu 


Phe 


145 










150 






Pro 


Met 


lie 


Leu 


Ser 


Ala 


Met 


lie 










165 








Ala 


He 


Leu 


lie 


Thr 


Ala 


Val 


lie 








180 










Val 


Glu 


He 


Glu 


Met 


Asp 


lie 


Ser 






195 










200 


Gly Val 


Lys 


Asp 


Asn 


Glu 


Glu 


Ser 




210 










215 




Thr Asp 


Thr 


Pro 


Thr 


Tyr 


Leu Asp 


225 










230 






Tyr Asn 


Asp 


Thr 


Gin 


Gly 


Leu 


Ser 










245 








Gly Cys 


Tyr 


Ala 


Leu 


Pro 


Asp 


Leu 








260 










Lys 


Lys 


Ser 


Phe 


Leu 


Asn 


Val 


Lys 






275 










280 


Pro 


Met 


Met 


Gly 


Phe 


He 


Gin 


Gly 




290 










295 




Ala 


Ala 


Ser 


Arg 


Leu 


Ser 


Leu 


Ser 


305 










310 






Ala 


Ser 


Ser 


Val 


Thr 


Asp 


Ser 


Gin 










"3 O C 

325 








Gin 


Leu 


Met 


Val 


Ala 


Phe 


Ala 


Phe 








340 










Leu 


His 


Asp 


Tyr 


Thr 


Tyr 


Asp 


He 






355 










360 


He 


Glu 


Asn 


His 


Val 


Ala 


Pro 


Leu 




370 










375 




Met 


Lys 


Cys 


Glu 


Tyr 


Gin 


Gin 


lie 


385 
















He 


Asn 


Ala 


Asn 


Tyr 


Leu 


Thr 


Gly 










4UO 








Asp 


He 


He 


Asn 


Tyr 


Asn 


Asn 


Ser 








/ion 










Phe 


Ala 


Phe 


Tyr 


Lys 


Lys 


Thr Tyr 






435 










440 


Asp 


He 


Ser 


Asn 


Gly 


Lys 


Leu 


Ser 




450 










455 




Gin 


Phe 


Ser 


He 


Asn 


Gly 


Lys 


His 


465 










470 






He 


He 


He 


Ser 


Glu 


Gly 


Ser 


Leu 



485 



Phe 


Phe Tyr 


TI- 
IlS 


Ser 


O A w 

ber 


lie 


rne 


ZD 








■3 n 

JU 






Pne 


Gin Tyr 


Pne 


Thr 


Ala 


TV 1 -j 

Ala 


rne 








4 5 








Ser 


Met Lys 


Leu 


Asp 


Tyr 


lie 


Lys 






oU 










Ala 


Trp Tyr 


Leu 


lie 


Ser 


TV I _ 

Ala 


tf_ 1 

val 




75 










O A 


He 


Gin HIS 


Arg 


lie 


Lys 


TV 1 rs 

Aia 


Tyr 




OA 












He 


Val Leu 


Thr 


He 


Thr 


Asp 


val 


105 








110 






lie 


Ala Met 


Asn 


Asn 


lie 


Leu 


Leu 








125 








TV T — 

Ala 


lie Leu 


Leu 


Ser 


Leu 


rne 


Met 






140 










His 


Leu Asn 


Lys 


lie Lys 


His 


He 




ICC 

lob 










1 GA 

1 bU 


Pro 


Leu Val 


Ser 


Ala 


lie 


Tl » 

lie 


He 




170 








175 




Tyr 


Leu Leu 


Phe 


Ala 


Arg 


Gin 


TV 1 — 

Ala 


185 








190 






Glu 


Gly Ser 


Asp 


He 


Ala 


Tyr 


Val 








205 








Phe 


Gly Phe 


Leu 


Asn Asp 


Lys 


Lys 






O 1 r\ 

220 










Val 


lie Lys 


Asn 


Gly Ser 


Leu 


He 




I Jo 










0 /1 A 


Gly 


Ala Asp 


lie 


Tyr 


lie 


TY 1 

val 


ber 




O £ A 








ZOO 




Leu 


Arg Asn 


Val 


Pro 


Leu 


Asp 


7v 1 

Aia 


o c c 

265 








270 






Lys 


Leu Glu 


He 


Thr 


Gin 


Lys 


Leu 








285 








CjIU 


Ser Ala 


Asp 


Val 


Met 


Pro 


Lys 






JUU 










Lys 


Gin Asp 


ASp 


Lys 


Phe 


Met 


Leu 
















lie 


T • mm, nWn 

Lys rne 


Lys 


Ser 


Asn 


Asn 


Til a 

Aia 




■j *a rv 








TIC 
JJJ 




Met 


Pro lie 


1 nr 


Thr 


Asn 


biy 


lie 


J4b 








350 






He 


He Asn 


Asp 


Lys 


Lys 


Tyr 


Lys 








365 








Ser 


Arg Leu 


Asp 


Lys 


Asn 


Lys 


Lys 
















Ser 


Asp Leu 


Thr 


Asn 


Thr 


Tyr 


TV asm 

Asn 














A A A 


Phe 


Leu Leu 


val 


Leu 


Lys 


Pro 


Asp 




41U 








410 




Pro 


Ser Val 


Leu 


Leu 


Lys 


1 nr 


Asp 


h £ 3 








430 






Gin 


Lys Leu 


Asp 


Lys 


He 


Tyr 


Asp 








445 








Ser 


Leu Arg 


Ala 


Thr Gly 


lie 


Ser 






4 60 










Leu 


Ser Leu 


Arg 


Pro 


Glu 


Ser 


Glu 




475 










480 


Tyr 


Gly Leu 


Val 


Asn Lys 


Ser 


Lys 




490 








495 
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Lys lie Lys lie Tyr Gly Thr Ala 
500 

lie Met Asn Leu Arg Lys lie Thr 
515 520 
Phe Gly Ser Ser lie Met Asp lie 

530 535 
Leu Leu Ala lie Ser lie Lys Phe 
545 550 
Val Asn Glu Asn Leu Phe Leu 
565 



Asp Leu Val Phe Val Asp Asn Lys 
505 510 
Tyr Leu Gin Ser Lys Leu Glu He 
525 

Leu Lys Tyr He Phe Gly Leu Gly 
540 

He His Ser Tyr Phe Lys Asn Asp 
555 560 



<210> 410 
<211> 363 
<212> PRT 

<213> Escherichia coli 



<400> 410 
















Met 


Ser 


Asn 


Phe 


He 


Asn He 


His 


Val 


Leu 


He Ser His Ser Pro Ser 


1 








5 








10 


15 


Cys 


Leu 


Asn 


Arg 


Asp 


Asp Met Asn 


Met 


Gin 


Lys Asp Ala He Phe Gly 






20 








25 




30 


Gly 


Lys 


Arg 


Arg 


Val 


Arg He 


Ser 


Ser 


Gin Ser Leu Lys Arg Ala Met 


35 








40 






45 


Arg 


Lys 


Ser 


Gly 


Tyr 


Tyr Ala 


Gin 


Asn 


He 


Gly Glu Ser Ser Leu Arg 




50 








55 








60 


Thr 


He 


His 


Leu 


Ala 


Gin Leu 


Arg 


Asp 


Val 


Leu Arg Gin Lys Leu Gly 


65 










70 








75 80 


Glu 


Arg 


Phe 


Asp 


Gin 


Lys He 


He 


Asp 


Lys 


Thr Leu Ala Leu Leu Ser 








85 








90 


95 


Gly 


Lys 


Ser 


Val 


Asp 


Glu Ala Glu Lys 


He 


Ser Ala Asp Ala Val Thr 




100 








105 




110 


Pro 


Trp 


Val 


Val 


Gly 


Glu He 


Ala 


Trp 


Phe Cys Glu Gin Val Ala Lys 




115 








120 






125 


Ala 


Glu 


Ala 


Asp 


Asn 


Leu Asp Asp 


Lys 


Lys 


Leu Leu Lys Val Leu Lys 




130 








135 








140 


Glu 


Asp 


He 


Ala 


Ala 


He Arg 


Val 


Asn 


Leu 


Gin Gin Gly Val Asp He 


145 








150 








155 160 


Ala 


Leu 


Ser 


Gly 


Arg 


Met Ala 


Thr 


Ser 


Gly Met Met Thr Glu Leu Gly 










165 








170 


175 


Lys 


Val 


Asp 


Gly 


Ala 


Met Ser 


He 


Ala 


His 


Ala He Thr Thr His Gin 






180 








185 




190 


Val 


Asp 


Ser 


Asp 


He 


Asp Trp 


Phe 


Thr 


Ala 


Val Asp Asp Leu Gin Glu 






195 








200 






205 


Gin 


Gly 


Ser 


Ala 


His 


Leu Gly Thr Gin 


Glu 


Phe Ser Ser Gly Val Phe 




210 








215 








220 


Tyr 


Arg 


Tyr 


Ala 


Asn 


He Asn 


Leu 


Ala 


Gin 


Leu Gin Glu Asn Leu Gly 


225 








230 








235 240 


Gly 


Ala 


Ser 


Arg 


Glu 


Gin Ala 


Leu 


Glu 


He 


Ala Thr His Val Val His 








245 








250 


255 


Met 


Leu 


Ala 


Thr 


Glu 


Val Pro 


Gly 


Ala 


Lys 


Gin Arg Thr Tyr Ala Ala 








260 








265 




270 


Phe 


Asn 


Pro 


Ala 


Asp 


Met Val 


Met 


Val 


Asn 


Phe Ser Asp Met Pro Leu 






275 






280 






285 


Ser 


Met 


Ala 


Asn 


Ala 


Phe Glu Lys 


Ala 


Val Lys Ala Lys Asp Gly Phe 




290 








295 








300 


Leu 


Gin 


Pro 


Ser 


He 


Gin Ala 


Phe 


Asn 


Gin 


Tyr Trp Asp Arg Val Ala 


305 










310 








315 " 320 


Asn 


Gly 


Tyr 


Gly 


Leu 


Asn Gly Ala Ala 


Ala 


Gin Phe Ser Leu Ser Asp 










325 








330 


335 


Val 


Asp 


Pro 


He 


Thr 


Ala Gin 


Val 


Lys 


Gin 


Met Pro Thr Leu Glu Gin 
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340 345 350 

Leu Lys Ser Trp Val Arg Asn Asn Gly Glu Ala 
355 360 

<210> 411 
<211> 121 
<212> PRT 

<213> Escherichia coli 
<400> 411 

Met Met Ser Thr Thr Leu Phe Lys Asp Phe Thr Phe Glu Ala Ala His 

15 10 15 

Arg Leu Pro His Val Pro Glu Gly His Lys Cys Gly Arg Leu His Gly 

20 25 30 

His Ser Phe Met Val Arg Leu Glu lie Thr Gly Glu Val Asp Pro His 

35 40 45 

Thr Gly Trp He He Asp Phe Ala Glu Leu Lys Ala Ala Phe Lys Pro 

50 * 55 60 

Thr Tyr Glu Arg Leu Asp His His Tyr Leu Asn Asp He Pro Gly Leu 
65 70 75 80 

Glu Asn Pro Thr Ser Glu Val Leu Ala Lys Trp He Trp Asp Gin Val 

85 90 95 

Lys Pro Val Val Pro Leu Leu Ser Ala Val Met Val Lys Glu Thr Cys 

100 105 HO 

Thr Ala Gly Cys He Tyr Arg Gly Glu 
115 120 

<210> 412 
<211> 433 
<212> PRT 

<213> Escherichia coli 
<400> 412 

Met Leu Cys Ser Gly Tyr Leu Gly His Asp Met Glu Asp Asp Cys Asp 

1 5 10 15 

He He He He Gly Ala Gly He Ala Gly Thr Ala Cys Ala Leu Arg 

20 25 30 

Cys Ala Arg Ala Gly Leu Ser Val Leu Leu Leu Glu Arg Ala Glu He 

35 40 4 5 

Pro Gly Ser Lys Asn Leu Ser Gly Gly Arg Leu Tyr Thr His Ala Leu 

50 " 55 60 

Ala Glu Leu Leu Pro Gin Phe His Leu Thr Ala Pro Leu Glu Arg Arg 
65 70 75 80 

He Thr His Glu Ser Leu Ser Leu Leu Thr Pro Asp Gly Val Thr Thr 

85 90 95 

Phe Ser Ser Leu Gin Pro Gly Gly Glu Ser Trp Ser Val Leu Arg Ala 

100 105 110 

Arg Phe Asp Pro Trp Leu Val Ala Glu Ala Glu Lys Glu Gly Val Glu 

115 * 120 125 

Cys He Pro Gly Ala Thr Val Asp Ala Leu Tyr Glu Glu Asn Gly Arg 

130 135 140 

Val Cys Gly Val lie Cys Gly Asp Asp He Leu Arg Ala Arg Tyr Val 
145 150 155 160 

Val Leu Ala Glu Gly Ala Asn Ser Val Leu Ala Glu Arg His Gly Leu 

165 170 175 

Val Thr Arg Pro Ala Gly Glu Ala Met Ala Leu Gly He Lys Glu Val 

180 185 190 

Leu Ser Leu Glu Thr Ser Ala He Glu Glu Arg Phe His Leu Glu Asn 

195 200 205 

Asn Glu Gly Ala Ala Leu Leu Phe Ser Gly Arg He Cys Asp Asp Leu 
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210 










215 




220 


Pro 


Gly 


Gly 


Ala 


Phe 


Leu Tyr 


Thr Asn Gin 


Gin Thr Leu Ser Leu Gly 


225 








230 






235 240 


lie 


Val 


Cys 


Pro 


Leu 


Ser 


Ser 


Leu Thr Gin 


Ser Arg Val Pro Ala Ser 








245 






250 


255 


Glu 


Leu 


Leu 


Thr 


Arg 


Phe 


Lys 


Ala His Pro 


Ala Val Arg Pro Leu He 








260 








265 


270 


Lys 


Asn 


Thr 


Glu 


Ser 


Leu 


Glu 


Tyr Gly Ala 


His Leu Val Pro Glu Gly 




275 










280 


285 


Gly 


Leu 


His 


Ser 


Met 


Pro 


Val 


Gin Tyr Ala 


Gly Asn Gly Trp Leu Leu 


290 










295 




300 


Val 


Gly 


Asp 


Ala 


Leu 


Arg 


Ser 


Cys Val Asn 


Thr Gly He Ser Val Arg 


305 








310 






315 320 


Gly 


Met 


Asp 


Met 


Ala 


Leu 


Thr 


Gly Ala Gin 


Ala Ala Ala Gin Thr Leu 






325 






330 


335 


He 


Ser 


Ala 


Cys 


Gin 


His Arg 


Glu Pro Gin 


Asn Leu Phe Pro Leu Tyr 








340 








345 


350 


His 


His 


Asn 


Val 


Glu 


Arg 


Ser 


Leu Leu Trp 


Asp Val Leu Gin Arg Tyr 






355 










360 


365 


Gin 


His 


Val 


Pro 


Ala 


Leu 


Leu 


Gin Arg Pro 


Gly Trp Tyr Arg Thr Trp 




370 










375 




380 


Pro 


Ala 


Leu 


Met 


Gin 


Asp 


He 


Ser Arg Asp 


Leu Trp Asp Gin Gly Asp 


385 










390 






395 400 


Lys 


Pro 


Val 


Pro 


Pro 


Leu Arg 


Gin Leu Phe 


Trp His His Leu Arg Arg 










405 






410 


415 


His 


Gly 


Leu 


Trp 


His 


Leu 


Ala 


Gly Asp Val 


He Arg Ser Leu Arg Cys 








420 








425 


430 


Leu 



















<210> 413 
<211> 98 
<212> PRT 

<213> Escherichia coli 



<400> 413 



Met 


Ala 


Ser 


Gly Gly Arg Cys 


Tyr 


1 






5 




Arg 


Asn 


Leu 


Trp Arg Val Ala 


Asp 








20 




Ser 


Val 


Glu 


Arg Gin Thr Ala 


Glu 






35 




40 


Gly Leu 


Phe 


Ser Leu Thr Pro 


Glu 




50 




55 




Ser 


Cys 


Leu 


Glu Cys Gly Thr 


Cys 


65 






70 




Leu 


Gin 


Gin 


Trp Arg Tyr Pro 


Pro 








85 




Phe 


Gly 









Gin Glu Ser Ala Met Ser Val Ala 

10 15 
Ala Pro His He Val Pro Ala Asp 
25 30 
Arg Leu He Asn Ala Cys Pro Ala 
45 

Gly Asn Leu Arg He Asp Tyr. Arg 
60 

Arg Leu Leu Cys Asp Glu Ser Thr 

75 80 
Ser Gly Phe Gly He Thr Tyr Arg 
90 95 



<210> 414 
<211> 191 
<212> PRT 

<213> Escherichia coli 
<400> 414 

Met Pro Leu Leu His Leu Leu Arg Gin Asn Pro Val He Ala Ala Val 

15 10 15 

Lys Asp Asn Ala Ser Leu Gin Leu Ala He Asp Ser Glu Cys Gin Phe 
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90 






25 










30 






Tin Car 
lie oci 


Ua 1 

val 


Leu 


T* V7T" 

iyr 


1 w &cn Tip 
o x y no it x x c 


Cys 


Thr 


He 


Ser 


Asn 


He 


Val 


Lys 




35 






40 










45 








Lys lie 


Lys 


Hen 
nsn 


Al A 
nld 


ox y iijo lyi 


Ala 

nx a 


Phe 


Tie 


uic 
mo 


Val 


Asp 


Leu 


Leu 


















60 










pi it rM */ 

oiu oiy 


nl -a 




Asn 


t V c Glu Val 


Val 


He 


Gin 


Phe 


Leu 


Lys 


Leu 


Val 










70 






75 










80 


inr biu 


nla 


nop 


Gl V 


Tie Tie Ser 

XXC IXC JC1 


Thr 


Lys 


Ala 


Ser 


Met 


Leu 


Lys 


Ala 








as 






90 










95 




Ala Arg 


nld 


uiu 


uiy 


PVie Php Pwc 


He 


His 


Arg 


Leu 


Phe 


He 


Val 


Asp 






1 00 






105 










110 






oer x±e 


oer 


true 




Acn Tip 31 c n 
no 11 x x c no 


x*y 9 


Gin 


Val 


Ala 


Gin 


Ser 


Asn 


Pro 




11 J 






120 










125 








Asp Cys 


He 


Glu 


He 


Leu Pro Gly 


Cys 


Met 


Pro 


Lys 


Val 


Leu 


Gly 


Trp 


130 








135 








140 










Val Thr 


Glu 


-uy o 


He 


Ara Gin Pro 


Leu 


lie 


Ala 


Glv 


Glv 

vjxy 


Leu 


Val 


Cys 


145 








150 






155 










160 


Asp Glu 


Glu 


nop 




J\m Ren Al A 
n.x y noil nxa 


He 


Asn 


Ala 


Glv 
oxy 


Val 


Val 


Ala 


Leu 








165 






170 










175 




Ser Thr 


Thr 


Asn 


Thr 


Gly Val Trp. 


Thr 


Leu 


Ala 


Lys 


Lys 


Leu 


Leu 








180 






185 










190 






<210> 415 
























<211> 134 
























<212> PRT 
























<213> Escherichia coli 


















<400> 415 
























Met Phe 


Met 


Thr 


Trp 


Glu Tyr Ala 


Leu 


He 


Glv 
uxy 


Leu 


Val 


Val 


Glv 

\jrxy 


lie 


1 






5 






10 










15 




lie He 


Gly 


Ala 


Val 


Ala Met Arg 


Phe 


Glv 

uxy 


Asn 


Arg 


Lys 


Leu 


Arg 


Gin 






20 






25 










30 






Gin Gin 


Ala 


Leu 


Gin 


Tyr Glu Leu 


Gl n 

VJ1U 


■Liy o 


Asn 


x»y o 


Ala 


Glu 


Leu 


Asp 




35 






40 










45 








Glu Tyr 


Arg 


Glu 


Glu 


Leu Val Ser 


His 


Phe 


Ala 


Arg 


Ser 


Ala 


Glu 


Leu 


50 








55 








60 










Leu Asp 


Thr 


Met 


Ala 


His Asp Tyr 


Arg 


Gin 


Leu 


Tvr 
j 


Gin 


His 


Met 


Ala 


65 








70 






75 










80 


Lys Ser 


•Ser 


Ser 


Ser 


Leu Leu Pro 


Glu 


Leu 


Ser 


Ala 


Glu 


Ala 


Asn 


Pro 






85 






90 










95 




Phe Arg 


Asn 


Arg 


Leu 


Ala Glu Ser 


Glu 


Ala 


Ser 


Asn 


Asp 


Gin 


Ala 


Pro 






100 






105 










110 






Val Gin 


Met 


Pro 


Arg 


Asp Tyr Ser 


Glu 


Glv 


Ala 


Ser 


Gly 


Leu 


Leu 


Arg 




115 






120 










125 








Thr Gly 


Ala 


Lys 


Arg 


Asp 


















130 


























<210> 416 
























<211> 455 
























<212> PRT 
























<213> Escherichia coli 


















<400> 416 
























Met Lys 


Lys 


Gin 


Thr 


Gin Leu Leu 


Ser 


Ala 


Leu 


Ala 


Leu 


Ser 


Val 


Gly 


1 




5 






10 










15 




Leu Thr 


Leu 


Ser 


Ala 


Ser Phe Gin 


Ala 


Val 


Ala 


Ser 


He 


Pro 


Gly 


Gin 






20 






25 










30 






Val Ala 


Asp 


Gin 


Ala 


Pro Leu Pro 


Ser 


Leu 


Ala 


Pro 


Met 


Leu 


Glu 


Lys 




35 






40 










45 








Val Leu 


Pro 


Ala 


Val 


Val Ser Val 


Arg 


Val 


Glu 


Gly 


Thr 


Ala 


Ser 


Gin 
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50 






55 






Gly 


Gin 


Lys 


He Pro 


Glu Glu 


Phe 


Lys 


65 








70 






Pro 


Asp 


Gin 


Pro Ala 


Gin Pro 


Phe 


Glu 








85 








lie 


Asn 


Ala 


Ser Lys Gly Tyr 


Val 


Leu 








100 






105 


Gin 


Ala 


Gin 


Lys He 


Ser He 


Gin 


Leu 






115 






120 




Ala 


Lys 


Leu 


He Gly Ser Asp 


Asp 


Gin 




130 






135 






He 


Gin 


Asn 


Pro Ser 


Lys Leu 


Thr 


Gin 


145 








150 






Lys 


Leu 


Arg 


Val Gly Asp Phe 


Ala 


Val 








165 








Leu 


Gly 


Gin 


Thr Ala 


Thr Ser 


Gly 


He 








180 






185 


Gly 


Leu 


Asn 


Leu Glu Gly Leu 


Glu 


Asn 






195 






200 




He 


Asn 


Arg 


Gly Asn Ser Gly 


Gly 


Ala 




210 






215 






Leu 


He 


Gly 


He Asn 


Thr Ala 


He 


Leu 


225 








230 






Gly 


He 


Gly 


Phe Ala 


He Pro 


Ser 


Asn 








245 








Gin 


Leu 


He 


Asp Phe Gly Glu 


lie 


Lys 








260 






265 


Gly 


Thr 


Glu 


Met Ser 


Ala Asp 


lie 


Ala 






275 






280 




Gin 


Arg 


Gly 


Ala Phe 


Val Ser 


Glu 


Val 




290 






295 






Lys 


Ala 


Gly 


Val Lys Ala Gly 


Asp 


He 


305 








310 






Pro 


Leu 


Asn 


Ser Phe 


Ala Glu 


Leu 


Arg 








325 








Pro 


Gly 


Thr 


Lys Val 


Lys Leu 


Gly 


Leu 








340 






345 


Glu 


Val 


Glu 


Val Thr 


Leu Asp 


Thr 


Ser 






355 






360 




Glu 


Met 


He 


Thr Pro 


Ala Leu 


Glu 


Gly 




370 






375 






Leu 


Lys 


Asp 


Gly Gly 


Lys Gly 


He 


Lys 


385 








390 






Ser 


Pro 


Ala 


Ala Gin 


Ala Gly 


Leu 


Gin 








405 








Val 


Asn 


Arg 


Asp Arg 


Val Asn 


Ser 


He 








420 






425 


Ala 


Ala 


Lys 


Pro Ala 


He He 


Ala 


Leu 






435 






440 




Ser 


He 


Tyr 


Leu Leu 


Met Arg 








450 






455 







<210> 417 
<211> 355 
<212> PRT 

<213> Escherichia coli 
<400> 417 

Met Phe Val Lys Leu Leu Arg Ser Val 
1 5 



PCT/USOO/30950 







60 










Lys 


Phe 


Phe 


Gly 


Asp 


Asp 


Leu 




75 










80 


Gly 


Leu 


Gly 


Ser 


Gly 


Val 


He 


90 










95 




Thr 


Asn 


Asn 


His 


Val 


lie 


Asn 










110 






Asn 


Asp 


Gly 


Arg 


Glu 


Phe 


Asp 








125 








Ser 


Asp 


He 


Ala 


Leu 


Leu 


Gin 






140 










He 


Ala 


lie 


Ala 


Asp 


Ser 


Asp 




155 










160 


Ala 


Val 


Gly 


Asn 


Pro 


Phe 


Gly 


170 










175 




Val 


Ser 


Ala 


Leu 


Gly 


Arg 


Ser 










190 






Phe 


He 


Gin 


Thr 


Asp 


Ala 


Ser 








205 








Leu 


Leu 


Asn 


Leu 


Asn 


Gly 


Glu 






220 










Ala 


Pro 


Gly 


Gly 


Gly 


Ser 


Val 




235 










240 


Met 


Ala 


Arg 


Thr 


Leu 


Ala 


Gin 


250 










255 




Arg 


Gly 


Leu 


Leu 


Gly 


lie 


Lys 










270 






Lys 


Ala 


Phe 


Asn 


Leu 


Asp 


Val 








285 








Leu 


Pro 


Gly 


Ser 


Gly 


Ser 


Ala 






300 










lie 


Thr 


Ser 


Leu 


Asn 


Gly 


Lys 




315 










320 


Ser 


Arg 


lie 


Ala 


Thr 


Thr 


Glu 


330 










335 




Leu 


Arg 


Asn 


Gly 


Lys 


Pro 


Leu 










350 






Thr 


Ser 


Ser 


Ser 


Ala 


Ser 


Ala 








365 








Ala 


Thr 


Leu 


Ser 


Asp 


Gly 


Gin 






380 










lie 


Asp 


Glu 


Val 


Val 


Lys 


Gly 




395 










400 


Lys 


Asp 


Asp 


Val 


lie 


He 


Gly 


410 










415 




Ala 


Glu 


Met 


Arq 


Lys 


Val 


Leu 










430 






Gin 


lie 


Val 


Arg 


Gly 


Asn 


Glu 








445 








Ala 


lie 


Gly 


Leu 


He 


Val 


Gly 


10 










15 





-376- 



WO 01/34810 



PCT/US00/30950 



Ala He Leu Leu Val Ala Met Pro Ser Leu Arg Ser Leu Asn Pro Leu 

20 25 30 

Ser Thr Pro Gin Phe Asp Ser Thr Asp Glu Thr Pro Ala Ser Tyr Asn 

35 40 45 

Leu Ala Val Arg Arg Ala Ala Pro Ala Val Val Asn Val Tyr Asn Arg 

50 55 60 

Gly Leu Asn Thr Asn Ser His Asn Gin Leu Glu He Arg Thr Leu Gly 
65 70 75 80 

Ser Gly Val He Met Asp Gin Arg Gly Tyr He He Thr Asn Lys His 

85 90 95 

Val He Asn Asp Ala Asp Gin He He Val Ala Leu Gin Asp Gly Arg 

100 105 HO 

Val Phe Glu Ala Leu Leu Val Gly Ser Asp Ser Leu Thr Asp Leu Ala 

115 120 125 

Val Leu Lys He Asn Ala Thr Gly Gly Leu Pro Thr He Pro He Asn 

130 135 140 

Ala Arg Arg Val Pro His He Gly Asp Val Val Leu Ala He Gly Asn 
145 " 150 155 160 

Pro Tyr Asn Leu Gly Gin Thr He Thr Gin Gly He He Ser Ala Thr 

165 170 175 

Gly Arg He Gly Leu Asn Pro Thr Gly Arg Gin Asn Phe Leu Gin Thr 

180 185 190 

Asp Ala Ser He Asn His Gly Asn Ser Gly Gly Ala Leu Val Asn Ser 

195 200 . 205 

Leu Gly Glu Leu Met Gly He Asn Thr Leu Ser Phe Asp Lys Ser Asn 

210 215 220 

Asp Gly Glu Thr Pro Glu Gly He Gly Phe Ala He Pro Phe Gin Leu 
225 230 235 240 

Ala Thr Lys He Met Asp Lys Leu He Arg Asp Gly Arg Val He Arg 

245 250 255 

Gly Tyr He Gly He Gly Gly Arg Glu He Ala Pro Leu His Ala Gin 

260 265 270 

Gly Gly Gly He Asp Gin Leu Gin Gly He Val Val Asn Glu Val Ser 

275 280 285 

Pro Asp Gly Pro Ala Ala Asn Ala Gly He Gin Val Asn Asp Leu He 

290 295 300 

He Ser Val Asp Asn Lys Pro Ala He Ser Ala Leu Glu Thr Met Asp 
305 ~ 310 315 320 

Gin Val Ala Glu He Arg Pro Gly Ser Val He Pro Val Val Val Met 

325 330 335 

Arg Asp Asp Lys Gin Leu Thr Leu Gin Val Thr He Gin Glu Tyr Pro 
340 345 350 

Ala Thr Asn 
355 



<210> 418 
<211> 118 
<212> PRT 

<213> Escherichia coli 
<400> 418 

Met Ala Arg He Ala Gly He Asn 

1 5 
He Ala Leu Thr Ser He Tyr Gly 
20 

lie Leu Ala Ala Ala Gly He Ala 

35 40 
Ser Glu Gly Gin He Asp Thr Leu 

50 55 
Val Glu Gly Asp Leu Arg Arg Glu 



He Pro Asp His Lys His Ala Val 

10 15 
Val Gly Lys Thr Arg Ser Lys Ala 
25 * 30 
Glu Asp Val Lys He Ser Glu Leu 
45 

Arg Asp Glu Val Ala Lys Phe Val 
60 

He Ser Met Ser He Lys Arg Leu 
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65 70 


75 






80 


Met Asp Leu Gly Cys Tyr Arg 


Gly Leu Arg His Arg Arg 


Gly 


Leu 


Pro 


85 


90 




95 




Val Arg Gly Gin Arg Thr Lys 


Thr Asn Ala Arg Thr Arg 


Lys 


Gly 


Pro 


100 


105 


110 







Arg Lys Pro lie Lys Lys 
115 



<210> 419 
<211> 129 
<212> PRT 

<213> Escherichia coli 



<400> 419 



Met 


Ala 


Lys 


Ala 


Pro 


He Arg 


Ala Arg Lys Arg Val Arg 


Lys 


Gin 


Val 


1 








5 






10 




15 




Ser 


Asp 


Gly 


Val 


Ala 


His 


He 


His Ala Ser Phe Asn Asn 


Thr 


He 


Val 




20 








25 


30 






Thr 


He 


Thr 


Asp 


Arg 


Gin 


Gly 


Asn Ala Leu Gly Trp Ala 


Thr 


Ala 


Gly 






35 










40 45 








Gly 


Ser 


Gly 


Phe 


Arg 


Gly 


Ser 


Arg Lys Ser Thr Pro Phe 


Ala 


Ala 


Gin 




50 










55 


60 








Val 


Ala 


Ala 


Glu 


Arg 


Cys 


Ala 


Asp Ala Val Lys Glu Tyr 


Gly 


He 


Lys 


65 










70 




75 






80 


Asn 


Leu 


Glu 


Val 


Met 


Val 


Lys 


Gly Pro Gly Pro Gly Arg 


Glu 


Ser 


Thr 










85 






90 




95 




He 


Arg 


Ala 


Leu 


Asn 


Ala 


Ala 


Gly Phe Arg He Thr Asn 


He 


Thr 


Asp 






100 








105 


110 






Val 


Thr 


Pro 


He 


Pro 


His 


Asn 


Gly Cys Arg Pro Pro Lys 


Lys 


Arg 


Arg 



115 120 125 



Val 



<210> 420 
<211> 206 
<212> PRT 

<213> Escherichia coli 



<400> 420 



Met 


Ala 


Arg 


Tyr 


Leu 


Gly Pro 


Lys 


Leu Lys Leu 


Ser Arg 


Arg 


Glu 


Gly 


1 








5 






10 






15 




Thr 


Asp 


Leu 


Phe 


Leu 


Lys Ser 


Gly 


Val Arg Ala 


He Asp 


Thr 


Lys 


Cys 








20 








25' 




30 






Lys 


He 


Glu 


Gin 


Ala 


Pro Gly 


Gin 


His Gly Ala Arg Lys 


Pro 


Arg 


Leu 




35 








40 




45 








Ser 


Asp 


Tyr 


Gly 


Val 


Gin Leu 


Arg Glu Lys Gin 


Lys Val 


Arg 


Arg 


He 




50 








55 






60 








Tyr 


Gly 


Val 


Leu 


Glu 


Arg Gin 


Phe Arg Asn Tyr 


Tyr Lys 


Glu 


Ala 


Ala 


65 










70 




75 








80 


Arg 


Leu 


Lys 


Gly 


Asn 


Thr Gly 


Glu 


Asn Leu Leu 


Ala Leu 


Leu 


Glu 


Gly 








85 






90 






95 




Arg 


Leu 


Asp 


Asn 


Val 


Val Tyr 


Arg 


Met Gly Phe 


Gly Ala 


Thr 


Arg 


Ala 








100 








105 




110 






Glu 


Ala 


Arg 


Gin 


Leu 


Val Ser 


His 


Lys Ala He 


Met Val 


Asn 


Gly 


Arg 






115 








120 




125 








Val 


Val 


Asn 


He 


Ala 


Ser Tyr 


Gin 


Val Ser Pro Asn Asp 


Val 


Val 


Ser 




130 








135 






140 








He 


Arg 


Glu 


Lys 


Ala 


Lys Lys 


Gin 


Ser Arg Val 


Lys Ala 


Ala 


Leu 


Glu 


145 










150 




155 








160 


Leu 


Ala 


Glu 


Gin 


Arg 


Glu Lys 


Pro 


Thr Trp Leu 


Glu Val 


Asp 


Ala 


Gly 
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165 

Lys Met Glu Gly Thr Phe Lys Arg 
180 

Ala Asp lie Asn Glu His Leu lie 
195 200 



170 175 
Lys Pro Glu Arg Ser Asp Leu Ser 
185 190 
Val Glu Leu Tyr Ser Lys 
205 



<210> 421 
<211> 329 
<212> PRT 

<213> Escherichia coli 



<400> 421 



Met 


Gin 


Gly Ser 


va J. 


Thy 

i nr 


Glu Phe Leu Lys Pro Arg 


Leu Val Asd He 


1 








c 
D 




10 




15 


Glu 


Gin 


Val 


Ser 


Ser 


inr 


His Ala Lys Val Thr 


Leu 


Gl u Pro Lph Glu 








20 






25 




30 


Arg 


Gly 


Phe 


Gly 


HIS 


i nr 


Leu Gly Asn Ala Leu Arg 


Ara He Leu Leu 


35 








40 




45 


Ser 


Ser 


Met 


Pro 






Ala Val Thr Glu Val 


Glu 


He Asp Gly Val 




50 










55 


60 




Leu 


His 


Glu 


Tyr 


Cor 

ocx 


Thr 

X. ill. 


Lys Glu Gly Val Gin Glu 


Asp He Leu Glu 


65 










70 


75 




80 


lie 


Leu 


Leu 


Asn 


T on 


Lys 


Gly Leu Ala Val Arg 


Val 


Gin Gly Lys Asp 










R S 




90 




95 


Glu 


Val 


He 


Leu 


Thi- 

1 11X 


Leu 


Asn Lys Ser Gly He 


Gly 


Pro Val Thr Ala 








100 






105 




110 


Ala Asp 


He 


Thr 


nio 


Asp 


Gly Asp Val Glu He 


Val 


Lys Pro Gin His 






115 








120 




125 


Val 


He 


Cys 


His 


J-jtr Li 


Thr 


Asp Glu Asn Ala Ser 


He 


Ser Met Arg He 




130 








135 


140 




Lys 


Val 


Gin Arg 






Gly Tyr Val Pro Ala 


Ser 


Thr Arg He His 


145 










150 


155 




160 


Ser 


Glu 


Glu Asp 


Glu 


Ara 


Pro He Gly Arg Leu 


Leu 


Val Asp Ala Cys 










165 




170 




175 


Tyr 


Ser 


Pro 


Val 




Arg 


He Ala Tyr Asn Val Glu 


Ala Ala Arg Val 






180 






185 




190 


Glu 


Gin 


Arg 


Thr 


Asp 


Leu 


Asp Lys Leu Val He 


Glu 


Met Glu Thr Asn 






195 








200 




205 


Gly 


Thr 


He 


Asp 


Pro 


Glu 


Glu Ala He Arg Arg 


Ala 


Ala Thr He Leu 


210 










215 


220 




Ala 


Glu 


Gin 


Leu 


Glu 


Ala 


Phe Val Asp Leu Arg Asp 


Val Arg Gin Pro 


225 










230 


• 235 




240 


Glu 


Val 


Lys 


Glu 


Glu 


Lys 


Pro Glu Phe Asp Pro 


He 


Leu Leu Arg Pro 








245 




250 




255 


Val 


Asp 


Asp Leu 


Glu 


Leu 


Thr Val Arg Ser Ala 


Asn 


Cys Leu Lys Ala 






260 






265 




270 


Glu 


Ala 


He 


His 


Tyr 


He 


Gly Asp Leu Val Gin Arg 


Thr Glu Val Glu 






275 






280 




285 


Leu 


Leu 


Lys 


Thr 


Pro 


Asn 


Leu Gly Lys Lys Ser 


Leu 


Thr Glu He Lys 




290 








295 


300 




Asp. Val 


Leu 


Ala 


Ser 


Arg 


Gly Leu Ser Leu Gly Met 


Arg Leu Glu Asn 


305 










310 


315 




320 


Trp 


Pro 


Pro 


Ala 


Ser 


He 


Ala Asp Glu 












325 











<210> 422 
<211> 127 
<212> PRT 

<213> Escherichia coli 
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<400> 422 








Met 


Arg 


His 


Arg 


Lys 


Ser Gly Arg 


1 








5 




Arg 


Gin 


Ala 


Met 


Phe 


Arg Asn Met 








20 






lie 


He 


Lys 


Thr 


Thr 


Leu Pro Lys 






35 






40 


Glu 


Pro 


Leu 


He 


Thr 


Leu Ala Lys 




50 








55 


Leu 


Ala 


Phe 


Ala 


Arg 


Thr Arg Asp 


65 










70 


Asn 


Glu 


Leu 


Gly 


Pro 


Arg Phe Ala 










85 




He 


Leu 


Lys 


Cys 


Gly 


Phe Arg Ala 








100 






He 


Glu 


Leu 


Val 


Asp 


Arg Ser Glu 






115 






120 



<210> 423 
<211> 46 
<212> PRT 

<213> Escherichia coli 
<400> 423 

Met Lys Arg Thr Phe Gin Pro Ser 

1 5 
Gly Phe Arg Ala Arg Met Ala Thr 
20 

Arg Arg Arg Ala Lys Gly Arg Ala 
35 40 

<210> 424 
<211> 119 
<212> PRT 

<213> Escherichia coli 
<400> 424 



Met 


Val 


Lys 


Leu 


Ala 


Phe 


Pro 


Arg 


1 








5 








Gin 


Phe 


Thr 


Phe 
20 


Val 


Phe 


Gin 


Gin 


He 


Thr 


He 
35 


Leu 


Gly 


Arg 


Leu 


Asn 
40 


Leu 


Thr 
50 


Val 


Ala 


Lys 


Lys 


Asn 
55 


Val 


He 


Lys 


Arg 


Leu 


Thr 


Arg 


Glu 


Ser 


65 










70 






Pro 


Ala 


Met 


Asp 


Phe 
85 


Val 


Val 


Val 


Asp 


Asn 


Arg 


Ala 
100 


Leu 


Ser 


Glu 


Ala 


Cys 


Arg 


Leu 


Ala 


Arg 


Gly Ser 








115 













<210> 425 
<211> 591 
<212> PRT 

<213> Escherichia coli 
<400> 425 



Gin 


Leu Asn Arg Asn 


Ser 


Ser 


His 




10 






15 




Ala 


Gly Ser Leu Val 


Arg 


His 


Glu 


25 






30 






Ala 


Lys Glu Leu Arg 


Arg 


Val 


Val 






45 








Thr 


Asp Ser Val 


Ala 


Asn 


Arg Arg 




60 










Asn 


Glu He Val 


Ala 


Lys 


Leu 


Phe 




75 








80 


Ser 


Arg Ala Gly Gly 


Tvr 


Thr Arg 




90 






95 




Gly 


Asp Asn Ala 


Pro 


Met 




iyr 


105 






110 






Lys 


Ala Glu Ala 


Ala 


Ala 


Glu 








125 








Val 


Leu Lys Arg Asn 


Arg 


Ser 


His 




10 






15 




Lys 


Asn Gly Arg Gin 


Val 


Leu 


Ala 


25 






30 






Arg 


Leu Thr Val 


Ser 


Lys 










45 








Glu 


Leu Arg Leu 


Leu 


Thr 


Pro 


Ser 




10 






15 




Pro 


Gin Arg Ala 


Gly 


Thr 


Pro 


Gin 


25 






30 






Ser 


Leu Gly His 


Pro 


Arg 


He 


Gly 






45 








Arg 


Arg Ala His 


Glu 


Arg 


Asn 


Arg 




60 










Phe 


Arg Leu Arg 


Gin 


His 


Glu 


Leu 




75 








80 


Ala 


Lys Lys Gly Val 


Ala 


Asp 


Leu 




90 






95 




Leu 


Glu Lys Leu 


Trp 


Arg 


Arg 


His 


105 






110 
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Met lie Glu Lys Leu Arg Asn lie Ala lie lie Ala His Val Asp His 

1 5 10 15 

Gly Lys Thr Thr Leu Val Asp Lys Leu Leu Gin Gin Ser Gly Thr Phe 

20 25 30 

Asp Ser Arg Ala Glu Thr Gin Glu Arg Val Met Asp Ser Asn Asp Leu 

35 40 45 

Glu Lys Glu Arg Gly lie Thr lie Leu Ala Lys Asn Thr Ala lie Lys 

50 55 60 

Trp Asn Asp Tyr Arg lie Asn lie Val Asp Thr Pro Gly His Ala Asp 
65 " 70 75 80 

Phe Gly Gly Glu Val Glu Arg Val Met Ser Met Val Asp Ser Val Leu 

85 90 95 

Leu Val Val Asp Ala Phe Asp Gly Pro Met Pro Gin Thr Arg Phe Val 

100 105 110 

Thr Lys Lys Ala Phe Ala Tyr Gly Leu Lys Pro lie Val Val lie Asn 

115 ~ 120 125 

Lys Val Asp Arg Pro Gly Ala Arg Pro Asp Trp Val Val Asp Gin Val 

130 135 140 

Phe Asp Leu Phe Val Asn Leu Asp Ala Thr Asp Glu Gin Leu Asp Phe 
145 150 155 160 

Pro lie Val Tyr Ala Ser Ala Leu Asn Gly lie Ala Gly Leu Asp His 

165 170 175 

Glu Asp Met Ala Glu Asp Met Thr Pro Leu Tyr Gin Ala lie Val Asp 

180 185 190 

His Val Pro Ala Pro Asp Val Asp Leu Asp Gly Pro Phe Gin Met Gin 

195 * 200 205 

He Ser Gin Leu Asp Tyr Asn Ser Tyr Val Gly Val He Gly He Gly 

210 215 220 

Arg He Lys Arg Gly Lys Val Lys Pro Asn Gin Gin Val Thr He He 
225 230 235 240 

Asp Ser Glu Gly Lys Thr Arg Asn Ala Lys Val Gly Lys Val Leu Gly 

245 250 255 

His Leu Gly Leu Glu Arg He Glu Thr Asp Leu Ala Glu Ala Gly Asp 

260 265 270 

He Val Ala He Thr Gly Leu Gly Glu Leu Asn lie Ser Asp Thr Val 

275 280 285 

Cys Asp Thr Gin Asn Val Glu Ala Leu Pro Ala Leu Ser Val Asp Glu 

290 295 300 

Pro Thr Val Ser Met Phe Phe Cys Val Asn Thr Ser Pro Phe Cys Gly 
305 310 315 320 

Lys Glu Gly Lys Phe Val Thr Ser Arg Gin He Leu Asp Arg Leu Asn 

325 330 335 

Lys Glu Leu Val His Asn Val Ala Leu Arg Val Glu Glu Thr Glu Asp 

340 345 350 

Ala Asp Ala Phe Arg Val Ser Gly Arg Gly Glu Leu His Leu Ser Val 

355 360 365 

Leu He Glu Asn Met Arg Arg Glu Gly Phe Glu Leu Ala Val Ser Arg 

370 375 380 

Pro Lys Val He Phe Arg Glu He Asp Gly Arg Lys Gin Glu Pro Tyr 
385 390 395 400 

Glu Asn Val Thr Leu Asp Val Glu Glu Gin His Gin Gly Ser Val Met 

405 410 415 

Gin Ala Leu Gly Glu Arg Lys Gly Asp Leu Lys Asn Met Asn Pro Asp 

420 425 430 

Gly Lys Gly Arg Val Arg Leu Asp Tyr Val He Pro Ser Arg Gly Leu 

435 440 445 

He Gly Phe Arg Ser Glu Phe Met Thr Met Thr Ser Gly Thr Gly Leu 

450 455 460 

Leu Tyr Ser Thr Phe Ser His Tyr Asp Asp Val Arg Pro Gly Glu Val 
465 470 475 480 
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Gly Gin 


Arg 


Gin 


Asn 


Gly Val 


Leu lie 








485 






Val Ala 


Phe 


Ala 


Leu 


Phe Gly 


Leu Gin 






500 






505 


Gly His 


Gly 


Ala 


Glu 


Val Tyr 


Glu Gly 


515 








520 


Arg Ser 


Asn 


Asp 


Leu 


Thr Val 


Asn Cys 


530 








535 




Asn Met 


Arg 


Ala 


Ser 


Gly Thr 


Asp Glu 


545 








550 




lie Arg 


Met 


Thr 


Leu 


Glu Gin 


Ala Leu 






565 






Leu Val 


Glu 


Val 


Thr 


Pro Thr 


Ser He 






580 






585 



<210> 426 
<211> 756 
<212> PRT 

<213> Escherichia coli 
<400> 426 



Met 


Met 


Lys 


Val 


Leu 
5 


He Val 


Glu 


Ser 


1 

Trp 


Val 


Gly 


Asn 


Ala 


Val Glu 


Arg 


Leu 








20 








25 


Asn 


Val 


Thr 


Val 


He 


Lys Ser 


Thr 


Ser 






35 








40 




Leu 


Ser 


Ser 


Asn 


Glu 


Ala He 


Asp Cys 




50 








55 






Glu 


His 


Pro 


Asp 


Glu 


His Gin 


Asn 


Val 


65 










70 






His 


Glu 


Arg 


Gin 


Gin 


Asn Val 


Pro 


Val 










85 








Lys 


Ala 


Leu 


Ala 


Ala 


Met Asp 


Arg Asp 






100 








105 


Phe 


Ala 


Trp 


He 


Leu 


Glu Asp 


Thr 


Ala 






115 








120 




Val 


Ala 


Ala 


Met 


Thr 


Arg Tyr 


Arg 


Gin 




130 








135 






Ser 


Ala 


Leu 


Met 


Lys 


Tyr Ser 


Asp 


He 


145 










150 






Pro 


Gly 


His 


Gin 


Gly 


Gly Val 


Gly 


Phe 










165 








Phe 


Tyr 


His 


Asp 


Tyr 


Tyr Gly 


Glu 


Asn 








180 








185 


He 


Glu 


Arg 


Thr 


Ser 


Leu Gly 


Ser 


Leu 






195 








200 




Gly 


Glu 


Ser 


Glu 


Lys 


Tyr Ala 


Ala 


Arg 




210 








215 






Trp 


Ser 


Val 


Val 


Val 


Gly Thr 


Ser Gly 


225 










230 






Ala 


Cys 


Met 


Thr 


Asp 


Asn Asp 


Val 


Val 










245 








Lys 


Ser 


He 


Glu 


Gin 


Gly Leu 


Met 


Leu 








260 








265 


Met 


Val 


Pro 


Ser 


Arg 


Asn Arg 


Tyr Gly 






275 








280 




Gin 


Glu 


Met 


Gin 


Pro 


Glu Thr 


Leu 


Gin 




290 








295 






Leu 


Thr 


Lys 


Asp 


Lys 


Ala Gly 


Gin Lys 
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Ser 


Asn 


Gly Gin 


Gly 


Lys Ala 


490 








4 95 




Asp 


Arg 


Gly Lys 


Leu 


Phe 


Leu 








510 






Gin 


He 


He Gly 


He 


His 


Ser 






525 








Leu 


Thr 


Gly Lys 


Lys 


Leu 


Thr 






540 








Ala 


Val 


Val Leu 


Val 


Pro 


Pro 




555 








560 


Glu 


Phe 


He Asp 


Asp 


Asp Glu 


570 








575 




Arg 


lie 


Arg Lys 


Arg 


His 










590 






Glu 


Phe 


Leu His 


Gin 


Asp 


Thr 


10 








15 




Ala 


Asp 


Ala Leu 


Ser 


Gin 


Gin 








30 






Phe 


Asp 


Asp Gly 


Phe 


Ala 


He 






45 








Leu 


Met 


Phe Ser 


Tyr 


Gin 


Met 






60 








Arg 


Gin 


Leu He 


Gly 


Lys 


Leu 




75 








80 


Phe 


Leu 


Leu Gly 


Asp 


Arg Glu 


90 








95 




Leu 


Leu 


Glu Leu 


Val 


Asp Glu 








110 






Asp 


Phe 


He Ala 


Gly 


Arg Ala 






125 








Gin 


Leu 


Leu Pro 


Pro 


Leu 


Phe 






140 








His 


Glu 


Tyr Ser 


Trp 


Ala 


Ala 




155 








160 


Thr 


Lys 


Thr Pro 


Ala 


Gly Arg 


170 








175 




Leu 


Phe 


Arg Thr 


Asp 


Met 


Gly 








190 






Leu 


Asp 


His Thr 


Gly 


Ala 


Phe 






205 








Val 


Phe 


Gly Ala 


Asp 


Arg 


Ser 






220 








Ser 


Asn 


Arg Thr 


He 


Met 


Gin 




235 








240 


Val 


Val 


Asp Arg 


Asn 


Cys 


His 


250 








255 




Thr 


Gly 


Ala Lys 


Pro 


Val 


Tyr 






270 






He 


He 


Gly Pro 


He 


Tyr 


Pro 






285 








Lys 


Lys 


He Ser 


Glu 


Ser 


Pro 






300 








Pro 


Ser 


Tyr Cys 


Val 


Val 


Thr 
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305 



Asn 


Cys 


Th r 

i nr 


Tyr 


Leu 


Leu 


Glu 


Lys 








340 


Gly 


Tyr 


Ala 


Arg 






J J J 




Gly 


G1U 


Pro 


pi ., 
(a±y 




'inn 
370 






Tnr 


HIS 


Lys 


Leu 










Arg 




oiy 


Arg 


Met 


Met 


His 


Ala 








420 


Asp 


Val 


Ala 


vai 






4 J j 




Gin 


Glu 


val 


I le 




X R/"\ 
4 jU 






Leu 


Tyr 


i*ys 


blU 


4 DJ 








nsn 


JjyS 




vex 


Ala 


Asp 


Ala 


Pro 








500 


Met 


rj j 
HIS 


fro 








Rl r 




Trp 


Car 

oer 


McL 


Leu 










oiy 


OlU 


Asp 


oiy 


c « c 








rpw r 

i nr 


nia 


Trp 


Leu 


Phe 


Gin 


He 


Met 








580 


Gly 


Thr 


T mi 

L»eu 


vai 






ROC 

□ y j 




Asn 


xnr 


Pro 


Leu 




OlU 






Asp 


i nr 


Tyr 


Ala 

Aia 


COR 








Ala 


Trp 


Leu 


Lys 


Ser 


Gly 


Leu 


Pro 








660 


Tl- 
ilB 


vai 


Asp 


Asn 






£*7 R 
D / J 




Arg 


Tip 
X IS 


ai a 

nld 


2X1 a 










Leu 


Leu 


Ser 


Gly 


705 








Tyr 


Leu 


Arg 


Ser 


His 


Glu 


Thr 


Glu 








740 


Cys 


Val 


Lys 


Ala 






755 





<210> 427 
<211> 253 





310 






Asp 


Oiy 


Val 

vol 


L Y 5 


395 








i nr 




Asp 


A T~n 

nrg 




no I I 


Pro 


lie 








360 


Asp 


His 


Asn 


Gly 






j / j 




Leu 


Asn 


Ala 


Leu 




390 






oiy 


nld 


Tip 


Asn 


405 








inr 


i nr 




rro 






LTlC L. 


no£J 








440 


Asp 


Glu 


Ala 


Val 






ft JJ 




rne 


i nr 


nla 


TV 

Asp 




470 






Va 1 
val 


J. Ill 




Prn 


ft O J 








1 nr 


Lys 


T.on 


xicu 


ul U 




ixp 


His 








520 


Asp 


Pro 


He 


Lys 






535 
•j j j 




m n 

^J-L u 


T.on 
lieu 


Glu 


Glu 




550 








Arg 


nio 


oxy 


5fi5 










Leu 


file 


Cor 


• 

Asn 


Thr 

L ill 


Leu 










600 


Ala 


Gin 


Val 


Met 






£1 R 
OX J 




TV 

Asn 




oxy 


Tl O 
1 1c 




630 






biu 


Asn 


Asn 


Pro 


£4 R 








Vox 


nld 


UlU 


val 


Asn 


Val 


Glu 


Leu. 








680 


Asn 


Ser 


Val 


He 






695 




Glu 


Asn 


Phe 


Gly 




710 






Leu 


Gin 


Ser 


Trp 


725 








Gly 


Thr 


Glu 


He 







n 5 
ji j 


Tur 

iyr 


Aon 
no 11 


Ala Lys 




nn 

JJU 




i>eu 


Hi «. 


tlie nop 


14 5 






iyr 


Ala 
nld 


len Hie 

nsp nl9 


nu 


Thr 


Val Ph» 
vai ciixs 






380 


Ser 


Gin 


Ala Ser 






395 


Pho 
r lie 


C "V* 

JCl 


&rrr Pho 
nty true 




410 




Leu 


Tyr 


ala Tie 
nld lie 


*i C J 






oiy 


Asn 


jci oiy 


Asp 


pho 
rile 


niy bin 






460 


Gly 


Ser 


Trp Phe 






4 75 


uin 


rphr 

i nr 


Gly Lys 




4 QO 




Thr 
1 111 


Thr 

i nr 


Val m n 

vai oin 


JUJ 






oxy 


ITl lu 


T.\/^ Don 


vai 


Ser 


IXC JjCU 






540 


Thr 


Gly 


Val Pro 






555 

JJ j 


Tip 

lie 


ya l 
val 


Drn Thr 
trio 1111 




5*70 




Mot- 
wet 


oiy 


Val Thr 
Vol xni 


585 

J O J 






Oct 


Pha 

rue 


lay is rtiy 


Pro 


OXU 


Ton Va 1 
Ijcll vai 






620 


His 


Asp 


Leu Gly 






D J J 


oiy 


Ala 
Hid 


7A r n Ton 

ni y lieu 




650 

D J v 




Th r 

i nr 


Pro 


Arg Glu 


665 






val 


oci 


11C OlU 


Pro 


Tyr 


Pro Pro 






700 


Asp 


Lys 


Asn Ser 






715 


Asp 


His 


His Phe 




730 




He 


Asp 


Gly He 



745 
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320 



Glu 


Ala 


Oil 1 


Asp 






3^5 
j j j 




Glu 


Ala 


T rn 


Tyr 
xyi 




350 






Tyr Ala 


Mpf 


A rrr 

Arg 
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m a 

nXo 


Thr 

inr 


nib 


Qo r 

oer 


Tyr 


He 


His 


Val 








400 


Asn 


Gin 


Ala 


T \/r 
iyr 






415 

" i J 




Cys 


Ala 


Ser 


Asn 




430 






Leu 


Ser 


Leu 


Thr 


445 








Ala 


Met 


Ala 
nld 


Arg 


Phe 


Lys 


Pro 


Trp 








480 


Thr Tyr 


Asp 


Pho 






4 95 

*a -7 J 




Asp Cys 


Trr> 
lrp 


Val 
vai 




510 






He 


Pro 


Asp 


Asn 


525 








Ala 


Pro 


Glv 
oiy 


Met 


Ala 


Ala 


Leu 


Val 








560 


Arg 


Thr 


Thr 

i nr 


Asp 






575 
j / j 




Arg Gly 


T wo 
jjy o 


Trn 

irp 




590 






His 


Tyr 


Asp 


Ala 


605 








Glu 


Gin 


iyr 


Pro 
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<210> 436 
<211> 203 
<212> PRT 

<213> Escherichia coli 



<400> 436 



Met 


Ser 


Ser 


Ser 


Gin 


Glu 


Leu Arg 


1 








5 






Leu 


Met 


Glu 


Thr 


Val 


Phe Asp Ala 








20 








Ser 


He 


Glu 


Leu 


Ala 


Ala Arg Leu 






35 








40 


Asn 


Glu 


Leu 


Trp 


Glu 


Leu 


Lys Lys 




50 










55 


Tyr 


Thr 


Trp 


Arg 


Val 


Ala Asp Asn 


65 










70 




Gin 


Ala 


Glu 


Leu 


Pro 


Glu 


Glu He 










85 






Ser 


Glu 


Cys 


Asp 


Leu 


Thr 


Ala Thr 








100 








Ala Asp 


Glu 


Leu 


Ala 


Thr 


Leu Phe 






115 








120 


Ser 


Thr 


Leu 


Ala 


Met 


Ala 


He Ser 




130 










135 


Gin 


Gly 


Gly 


Lys 


Phe 


Arg 


Tyr Cys 


145 










150 




Glu 


Pro 


Lys 


Ala 


Ala 


Ser 


Val Ser 










165 






Ser 


Ala 


Cys 


His 


Gly 


Val 


Leu He 








180 








Thr Lys 


Asn 


Ser 


Ala 


Thr Lys Met 






195 








200 



Ser 


Asn 


Phe Tyr Arg 


Glu 


Lys 


7\ r~ 

Asn 




10 






15 




Leu 


Lys 


Ala Met Gly 


Lys 


Ala 


Tnr 


25 






30 






Asp 


He 


Ser Arg Glu 


Glu 


Val 


Leu 






45 








Ala 


Gly 


Phe Val Asp 


Lys 


Ser 


Ala 






60 








Asn 


Val 


Gin Gin Glu 


.Gin 


Pro 


Ala 






75 






80 


Thr 


Thr 


Ala Thr Val 


Ala 


Lys 


He 




90 






95 




He Glu Gin Arg Gly 


Pro 


Gin 


Thr 


105 






110 






Gly Thr Thr Ser Arg 


Lys 


Val 


Ala 






125 








Lys Gly Arg Leu He 


Arg Val 


Asn 






140 








He 


Pro 


Gly Asp Asn 


Leu 


Pro 


Ala 






155 






160 


Pro 


Leu 


Trp Leu Ser 


Ala 


Ser 


Ser 




170 






175 




He 


Thr 


Val He Thr 


Pro 


Ser 


Pro 


185 






190 






Pro 


Glu 


Asn 









<210> 437 
<211> 101 
<212> PRT 
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<211> 437 
<212> PRT 

<213> Escherichia coli 
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Val 


bin 


Arg 




250 










o c c 




Pne 


Ala 


Val 


Leu 


Val 


lie 


Leu 




265 










O ^ ft 
Z. IK) 






Gly 


Asn 


He 


He 


Pro 


Ala 


Gly 


Tyr 










285 








Lys 


Leu 


Thr 


Tyr Lys 


Asn 


bly 


val 








300 










Leu 


He 


Cys 


Pro 


Trp 


Lys 


Leu 


Met 






315 










"3 O ft 


Phe 


Leu Asp 


He 


He 


biy 


bly 


Met 




330 










■JQC 

J Jo 




Met 


Ala 


His 


Tyr 


Phe 


Val 


Val 


Met 


345 










350 






Leu 


Tyr 


Thr 


Ala 


Pro 


Gly 


Asp 


Tyr 










365 








Leu 


Thr 


Ala 


Phe 


Ser 


Val 


Thr 


Leu 








380 










Gly 


Lys 


Phe 


He 


His 


Phe 


Met 


Glu 




395 








400 


Val 


Gly Val 


He 


Val 


Ala 


Phe 


Ala 




410 










415 




Thr 


Thr 


Ala 


Glu 


Lys 


Thr 


Gly 


Glu 


425 










430 
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435 

<210> 441 
<211> 464 
<212> PRT 

<213> Escherichia coli 



<400> 441 



Met 


Leu 


Leu 


Leu 


Asp 


Ala Cys Ser Gin Met 


Cys 


Pro 


Ser Phe Arg Arg 


1 








5 


10 






15 


Phe 


Gin 


Thr 


Val 


Phe 


His Asn Ser Ser He 


Phe 


Leu 


Pro Tyr Trp Leu 








20 




25 






30 


Ala 


Thr 


Leu 


Val 


Ser 


Phe Arg Glu Thr Phe 


Gin 


Glu 


Glu Lys Leu Leu 






35 






40 






45 


Thr 


Met 


Lys 


Gly 


Ser 


Tyr Lys Ser Arg Trp Val 


He 


Val lie Val Val 




50 








55 




60 




Val 


He 


Ala 


Ala 


He 


Ala Ala Phe Trp Phe Trp Gin 


Gly Arg Asn Asp 


65 










70 


75 




80 


Ser 


Arg 


Ser 


Ala 


Ala 


Pro Gly Ala Thr Lys 


Gin 


Ala 


Gin Gin Ser Pro 








85 


90 






95 


Ala 


Gly 


Gly 


Arg 


Arg 


Gly Met Arg Ser Gly 


Pro 


Leu 


Ala Pro Val Gin 








100 




105 






110 


Ala 


Ala 


Thr 


Ala 


Val 


Glu Gin Ala Val Pro Arg 


Tyr 


Leu Thr Gly Leu 






115 






120 






125 


Gly 


Thr 


He 


Thr 


Ala 


Ala Asn Thr Val Thr 


Val 


Arg 


Ser Arg Val Asp 




130 








135 




140 




Gly 


Gin 


Leu 


He 


Ala 


Leu His Phe Gin Glu 


Gly Gin 


Gin Val Lys Ala 


145 










150 


155 




160 


Gly 


Asp 


Leu 


Leu 


Ala 


Glu He Asp Pro Ser 


Gin 


Phe 


Lys Val Ala Leu 






165 


170 






175 


Ala 


Gin 


Ala 


Gin 


Gly 


Gin Leu Ala Lys Asp Lys Ala 


Thr Leu Ala Asn 








180 




185 






190 


Ala 


Arg 


Arg 


Asp 


Leu 


Ala Arg Tyr Gin Gin 


Leu 


Ala 


Lys Thr Asn Leu 






195 






200 






205 


Val 


Ser 


Arg 


Gin 


Glu 


Leu Asp Ala Gin Gin 


Ala 


Leu 


Val Ser Glu Thr 




210 








215 




220 




Glu 


Gly 


Thr 


He 


Lys 


Ala Asp Glu Ala Ser 


Val 


Ala 


Ser Ala Gin Leu 


225 










230 


235 




240 


Gin 


Leu 


Asp 


Trp 


Ser 


Arg He Thr Ala Pro 


Val 


Asp 


Gly Arg Val Gly 










245 


250 






255 


Leu 


Lys 


Gin 


Val 


Asp 


Val Gly Asn Gin He 


Ser 


Ser 


Gly Asp Thr Thr 








260 




265 






270 


Gly 


He 


Val 


Val 


lie 


Thr Gin Thr His Pro 


He Asp 


Leu Val Phe Thr 




275 






280 






285 


Leu 


Pro 


Glu 


Ser 


Asp 


He Ala Thr Val Val 


Gin 


Ala 


Gin Lys Ala Gly 




290 








295 




300 




Lys 


Pro 


Leu 


Val 


Val 


Glu Ala Trp Asp Arg Thr Asn 


Ser Lys Lys Leu 


305 










310 


315 




320 


Ser 


Glu 


Gly 


Thr 


Leu 


Leu Ser Leu Asp Asn 


Gin 


He 


Asp Ala Thr Thr 










325 


330 






335 


Gly 


Thr 


He 


Lys 


Val 


Lys Ala Arg Phe Asn 


Asn 


Gin 


Asp Asp Ala Leu 






340 




345 






350 


Phe 


Pro 


Asn 


Gin 


Phe 


Val Asn Ala Arg Met 


Leu 


Val 


Asp Thr Glu Gin 






355 






360 






365 


Asn 


Ala 


Val 


Val 


He 


Pro Thr Ala Ala Leu 


Gin 


Met 


Gly Asn Glu Gly 




370 








375 




380 




His 


Phe 


Val 


Trp 


Val 


Leu Asn Ser Glu Asn 


Lys 


Val 


Ser Lys His Leu 


385 










390 


395 




400 


Val 


Thr 


Pro 


Gly 


He 


Gin Asp Ser Gin Lys Val Val 


He Arg Ala Gly 










405 


410 






415 
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lie Ser Ala 


Gly Asp Arg 


Val 


Val 


Thr 


Asp 


Gly 


lie 


Asp 


Arg 


Leu 


Thr 




420 






425 










430 






Glu Gly Ala 


Lys Val Glu 


Val 


Val 


Glu 


Ala 


Gin 


Ser 


Ala 


Thr 


Thr 


Pro 


435 






440 










445 








Glu Glu Lys 


Ala Thr Ser 


Arg 


Glu 


Tyr 


Ala 


Lys 


Lys 


Gly 


Ala 


Arg 


Ser 


450 




455 










460 











<210> 442 
<211> 1040 
<212> PRT 

<213> Escherichia coli 



<400> 442 



Met 


Gin 


Val 


Leu 


Pro 


Pro 


Ser 


Ser 


Thr 


Gly 


Gly 


Pro 


Ser 


Arg 


Leu 


Phe 


1 








5 










10 










15 




lie 


Met 


Arg 


Pro 


Val 


Ala 


Thr 


Thr 


Leu 


Leu 


Met 


Val 


Ala 


He 


Leu 


Leu 






20 










25 










30 






Ala 


Gly 


He 


He 


Gly 


Tyr 


Arg 


Ala 


Leu 


Pro 


Val 


Ser 


Ala 


Leu 


Pro 


Glu 






35 










40 










A C 

45 








Val 


Asp 


Tyr 


Pro 


Thr 


He 


Gin 


Val 


Val 


Thr 


Leu 


Tyr 


Pro 


Gly 


Ala 


Ser 




50 




















DO 










Pro 


Asp 


Val 


Met 


Thr 


Ser 


Ala 


vai 


inr 


Ala 


Pro 


Leu 


GlU 


Arg 


Gin 


pne 


65 










-J A 
/U 










O C 










a n 


Gly 


Gin 


Met 


Ser 


Gly 


Leu 


Lys 


Gin 


Met 


O A V 

ser 


ber 


Gin 


Ser 


Ser 


Gly 


Gly 










o c 
OD 










on 

yu 










Q C 

yb 




Ala 


Ser 


val 


lie 


inr 


T At* 

Leu 


Gin 


pne 


Gin 


Leu 


mr 


Leu 


fro 


Leu 


Asp 


vai 








1 Art 

100 










lUo 










1 1 A 

11U 






Ala 


Glu 


Gin 


Glu 


val 


Gin 


Ala 


Ala 


Tl- 


Asn 


Ala 


Ala 


T»l-i ■»- 

inr 


7\ t-» 

Asn 


Leu 


Leu 






115 










1Z0 










IOC 








Pro 


Ser 


Asp 


Leu 


Pro 


Asn 


Pro 


Pro 


vai 


Tyr 


Ser 


Lys 


Val 


Asn 


Pro 


Ala 




1 JO 




















i a n 
14U 










Asp 


Pro 


Pro 


He 


Met 


Thr 


Leu 


Ala 


vai 


Thr 


Ser 


inr 


Ala 


Met 


Pro 


Met 


145 










t c n 

150 










ICC 

155 










IbU 


Thr 


Gin 


Val 


GlU 


Asp 


Met 


val 


GlU 


mi. „ 

l nr 


Arg 


val 


Ala 


Gin 


Lys 


lie 


ber 










"ICC 

1 DO 










1 f U 










IOC 

1 / 3 




Gin 


lie 


Ser 


Gly 


val 


Gly 


Leu 


XT r» 1 

vai 


inr 


Leu 


Ser 


Gly 


Gly 


Gin 


Arg 


Pro 








loU 




















i yu 






Ala 


Val 


Arg 


IT- I 

Val 


Lys 


Leu 


Asn 


Aia 


Gin 


Aia 


lie 


Ala 


Ala 


Leu 


Gly 


lieu 






195 




















one 








1 XI J. 




VjX U 


Thr* 
L ilL 


vdi 


Arg 


1 l IX. 


nla 


Tip 
lie 


Thr 


uiy 


Ala 


J\ on 


vai 


Asn 


OCJL 




210 










215 










220 










Ala 


Lys 


Gly 


Ser 


Leu 


Asp 


Gly 


Pro 


Ser 


Arg 


Ala 


Val 


Thr 


Leu 


Ser 


Ala 


225 










230 










235 










240 


Asn 


Asp 


Gin 


Met 


Gin 


Ser 


Ala 


Glu 


Glu 


Tyr 


Arg 


Gin 


Leu 


He 


He 


Ala 










245 










250 










255 




Tyr 


Gin 


Asn 


Gly 


Ala 


Pro 


He 


Arg 


Leu 


Gly 


Asp 


Val 


Ala 


Thr 


Val 


Glu 








260 










265 










270 






Gin 


Gly 


Ala 


Glu 


Asn 


Ser 


Trp 


Leu 


Gly 


Ala 


Trp 


Ala 


Asn 


Lys 


Glu 


Gin 






275 










280 










285 








Ala 


He 


Val 


Met 


Asn 


Val 


Gin 


Arg 


Gin 


Pro 


Gly 


Ala 


Asn 


He 


He 


Ser 




290 










295 










300 










Thr 


Ala 


Asp 


Ser 


He 


Arg 


Gin 


Met 


Leu 


Pro 


Gin 


Leu 


Thr 


Glu 


Ser 


Leu 


305 










310 










315 










320 


Pro 


Lys 


Ser 


Val 


Lys 


Val 


Thr 


Val 


Leu 


Ser 


Asp 


Arg 


Thr 


Thr 


Asn 


He 










325 










330 










335 




Arg 


Ala 


Ser 


Val 


Asp 


Asp 


Thr 


Gin 


Phe 


Glu 


Leu 


Met 


Met 


Ala 


He 


Ala 








340 










345 










350 






Leu 


Val 


Val 


Met 


He 


He 


Tyr 


Leu 


Phe 


Leu 


Arg 


Asn 


He 


Pro 


Ala 


Thr 






355 










360 










365 








He 


He 


Pro 


Gly 


Val 


Ala 


Val 


Pro 


Leu 


Ser 


Leu 


He 


Gly 


Thr 


Phe 


Ala 
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370 

Val Met Val Phe 
385 

Leu Thr lie Ala 

Glu Asn lie Ser 
420 

Ala Leu Lys Gly 
435 

Phe Ser Leu He 
450 

Val Gly Arg Leu 
465 

Leu He Ser Ala 

Arg Met Leu Ser 
500 

Ala Ser Glu Lys 
515 

Leu Ala Lys Val 
530 

Ser Thr Leu Leu 
545 

Phe Phe Pro Val 

Pro Gin Ser Ser 
580 

Ala Asp Val He 
595 

Val Gly Val Asp 
610 

He Asn Leu Lys 
625 

He Ala Arg Leu 

Phe Leu Gin Pro 
660 

Thr Gin Tyr Gin 
675 

Thr Trp Val Pro 
690 

Ser Asp Val Ser 
705 

Asn Val Asp Arg 

Val Asp Asn Ala 
740 

He Tyr Thr Gin 
755 

Glu Asn Thr Pro 
770 

Ser Asp Gly Gly 
785 

Arg Phe Ala Pro 

Thr He Ser Phe 
820 

Gin Ala He Met 
835 

Thr Thr Gin Phe 



375 



Leu 


Asp 


Phe 


Ser 




390 






Thr 


Gly 


Phe 


Val 


405 








Arg 


Tyr 


He 


Glu 


Ala 


Glv 


Glu 


He 








440 


Ala 


Val 


Leu 


He 






455 




Phe 


Arg 


Glu 


Phe 




470 






Val 


Val 


Ser 


Leu 


485 








Gin 


Glu 


Ser 


Leu 


Met 


Phe 


Asp 


Arg 








520 


Leu 


Asn 


His 


Pro 






535 




Leu 


Ser 


Val 


Leu 




550 






Gin 


Asp 


Asn 


Gly 


565 








Ser 


Phe 


Ala 


Asn 


Leu 


Gin 


Asp 


Pro 








600 


Gly 


Thr 


Asn 


Pro 






615 




Pro 


Leu 


Asp 


Glu 




630 






Gin 


Thr 


Ala 


Val 


645 








Thr 


Gin 


Asp 


Leu 


Phe 


Thr 


Leu 


Gin 








680 


Gin 


Leu 


Met 


Glu 






695 




Ser 


Asp 


Trp 


Gin 




710 






Asp 


Ser 


Ala 


Ser 


725 








Leu 


Tyr 


Asn 


Ala 


Ala 


Asn 


Gin 


Tyr 








760 


Gly 


Leu 


Ala 


Ala 






775 




Val 


Val 


Pro 


Leu 




790 






Leu 


Ser 


He 


Asn 


805 








Asn 


Val 


Pro 


Asp 


Asp 


Thr 


Glu 


Lys 








840 


Gin 


Gly 


Ser 


Thr 



380 



He 


Asn 


Asn 


Leu 






395 




Val 


Asp 


Asp 


Ala 




410 






Lys 


Gly 


Glu 


Lys 


425 








Gly 


Phe 


Thr 


He 


Pm 


Leu 


Leu 










460 


Ala 


He 


Thr 


Leu 






475 




Thr 


Leu 


Thr 


Pro 




4 90 






Arg 


Lys 


Gin 


Asn 


505 








He 


He 


Ala 


Ala 




Leu 


Thr 


Leu 








540 


Leu 


Trp 


Val 


Phe 






555 




He 


He 


Gin 


Gly 




570 






Met 


Ala 


Gin 


Arg 


585 








Ala 


Val 


Gin 


Ser 


Ser 


Leu 


Asn 


Ser 








620 


Arg 


Asp 


Asp 


Arg 






635 




Asp 


Lys 


Val 


Pro 




650 






Thr 


He 


Asp 


Thr 


665 








Ala 


Thr 


Ser 


Leu 


Lys 


Leu 


Gin 


Gin 








700 


Asp 


Lys 


Gly 


Leu 






715 




Arg 


Leu 


Gly 


He 




730 






Phe 


Gly 


Gin 


Arg 


745 








Arg 


Val 


Val 


Leu 


Leu 


ASD 


Thr 


He 








780 


Ser 


Ser 


He 


Ala 






795 




His 


Leu 


Asp 


Gin 




810 






Asn 


Tyr 


Ser 


Leu 


825 








Thr 


Leu 


Asn 


Leu 


Leu 


Ala 


Phe 


Gin 
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Thr 


Leu 


Met 


Ala 








400 


He 


Val 


Val 


He 






415 




Pro 


Leu 


Ala 


Ala 




430 






He 


Ser 


Leu 


Thr 


445 








Met 


Gly 


Asp 


He 


Ala 


Val 


Ala 


He 








480 


Met 


Met 


Cys 


Ala 






495 




Arg 


Phe 


Ser 


Arg 




510 






Tyr 


Gly 


Arg 


Gly 


525 








Ser 


Val 


Ala 


Leu 


He 


Pro 


Lys 


Glv 








560 


Thr 


Leu 


Gin 


Ala 






575 




Gin 


Arg 


Gin 


Val 




590 






Leu 


Thr 


Ser 


Phe 


605 








Ala 


Arg 


Leu 


Gin 


Val 


Gin 


LyS 


Val 








640 


Gly 


Val 


Asp 


Leu 






655 




Gin 


Val 


Ser 


Arg 




670 






Asp 


Ala 


Leu 


Ser 


685 








Leu 


Pro 


Gin 


Leu 


Val 


Ala 


Tvr 
j 


Val 








720 


Ser 


Met 


Ala 


Asp 






735 




Leu 


He 


Ser 


Thr 




750 






Glu 


His 


Asn 


Thr 


765 








Arg 


Leu 


Thr 


Ser 


Lys 


He 


Glu 


Gin 








800 


Phe 


Pro 


Val 


Thr 






815 




Gly 


Asp 


Ala 


Val 




830 






Pro 


Val 


Asp 


He 


845 








Ser 


Ala 


Leu 


Gly 
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850 855 860 



Ser Thr 


vax 


Trp 


Leu He 


VoX 


A! a 


Ala 


Val 


Val 


Ala 


Met 


Tvr 


He 


Val 


Of c 




fi"7fl 

o / u 










875 










880 


T All «* 

iieu kiy 


He 


Leu 


i yr vjiu 


Cat- 


DKp 

t Jlc 


He 


His 


Pro 


Tie* 


Thr 


He 


Leu 


Ser 






op c 








890 










895 




Thr Leu 


Pro 


i nr 

you 




Vol 


uxy 


Ala 
905 


Leu 


Leu 


Ala 


Leu 


Leu 
910 


He 


Ala 


Gly Ser 


blU 


lieu 


Asp vai 


lie 


/-VJL d 


He 


He Gly 


lie 


lie 


Leu 


Leu 


He 


915 








y^u 


















Gly He 


val 


Lys 


T iff* T\ ^ 

juys fisn 


nld 


lie 


Met 


Met 


He 


B en 


Php 
rue 


Ala 


Leu 


Ala 


930 








QIC 




















Ala Glu 


Arg 


Glu 


bin vaiy 


wet 


Ser 


Pro Arg 


Glu 


Ala 
nld 


He 


Tyr Gin Ala 


945 




you 










955 










960 


Cys Leu 


Leu 


Arg 


Phe Arg 


Pro 


He 


Leu 


Met 


Thr 


Thr 


Leu 


Ala 


Ala 


Leu 






965 








970 










975 




Leu Gly 


Ala 


Leu 


Pro Leu 


Met 


Leu 


Ser 


Thr 


Gly 


Val 


Gly 


Ala 


Glu 


Leu 




980 








985 










990 






Arg Arg 


Pro 


Leu 


Gly He 


Gly 


Met 


Val 


Gly Gly 


Leu 


He 


Val 


Ser 


Gin 


995 








1000 








1005 






Val Leu 


Thr 


Leu 


Phe Thr 


Thr 


Pro 


Val 


He 


Tyr 


Leu 


Leu 


Phe Asp Arg 


1010 






1015 








1020 








Leu Ala 


Leu Trp 


Thr Lys 


Ser Arg 


Phe 


Ala 


Arg 


His 


Glu 


Glu 


Glu 


Ala 



1025 1030 1035 1040 

<210> 443 
<211> 1025 
<212> PRT 

<213> Escherichia coli 



<400> 443 



Met 


Lys 


Phe 


Phe 


Ala 


Leu 


Phe 


He 


Tyr Arg 


Pro 


Val Ala 


Thr 


He 


Leu 


1 






5 








10 








15 




Leu 


Ser 


Val 


Ala 


He 


Thr 


Leu Cys 


Gly He 


Leu 


Gly Phe 


Arg Met 


Leu 








20 










25 






30 






Pro 


Val 


Ala 


Pro 


Leu 


Pro 


Gin 


Val 


Asp Phe 


Pro 


Val He 


lie 


Val 


Ser 






35 










40 






45 








Ala 


Ser 


Leu 


Pro 


Gly 


Ala 


Ser 


Pro 


Glu Thr 


Met 


Ala Ser 


Ser 


Val 


Ala 




50 










55 








60 








Thr 


Pro 


Leu 


Glu 


Arg 


Ser 


Leu 


Gly 


Arg He 


Ala 


Gly Val 


Ser 


Glu 


Met 


65 










70 








75 








80 


Thr 


Ser 


Ser 


Ser 


Ser 


Leu 


Gly 


Ser 


Thr Arg 


He 


He Leu 


Gin 


Phe Asp 










85 








90 








95 




Phe 


Asp 


Arg 


Asp 


He 


Asn 


Gly Ala 


Ala Arg Asp 


Val Gin 


Ala 


Ala 


He 








100 










105 






110 






Asn 


Ala 


Ala 


Gin 


Ser 


Leu 


Leu 


Pro 


Ser Gly Met 


Pro Ser 


Arg 


Pro 


Thr 






115 










120 






125 








Tyr 


Arg 


Lys 


Ala 


Asn 


Pro 


Ser Asp 


Ala Pro 


He 


Met He 


Leu 


Thr 


Leu 


130 










135 








140 








Thr 


Ser 


Asp 


Thr 


Tyr 


Ser 


Gin 


Gly 


Glu Leu 


Tyr 


Asp Phe 


Ala 


Ser 


Thr 


145 










150 








155 








160 


Gin 


Leu 


Ala 


Pro 


Thr 


He 


Ser 


Gin 


He Asp Gly 


Val Gly 


Asp 


Val 


Asp 










165 








170 








175 




Val 


Gly 


Gly 


Ser 


Ser 


Leu 


Pro 


Ala 


Val Arg Val 


Gly Leu 


Asn 


Pro 


Gin 






180 










185 






190 






Ala 


Leu 


Phe 


Asn 


Gin 


Gly 


Val 


Ser 


Leu Asp Asp 


Val Arg 


Thr 


Ala 


Val 






195 










200 






205 








Ser 


Asn 


Ala 


Asn 


Val 


Arg 


Lys 


Pro 


Gin Gly Ala 


Leu Glu 


Asp Gly Thr 




210 










215 








220 








His 


Arg 


Trp 


Gin 


He 


Gin 


Thr 


Asn 


Asp Glu 


Leu 


Lys Thr 


Ala 


Ala 


Glu 


225 










230 








235 








240 
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Tyr Gin 


Pro 


Leu 


He 


He 


His 


Tyr 


Asn 


Asn 


Gly 


Gly 


Ala 


Val 


Arg 


Leu 








245 










250 










255 




Gly Asp 


Val 


Ala 


Thr 


Val 


Thr 


Asp 


Ser 


Val 


Gin 


Asp 


Val 


Arg 


Asn 


Ala 






260 










265 










270 






Gly Met 


Thr 


Asn 


Ala 


Lys 


Pro 


Ala 


He 


Leu 


Leu 


Met 


He 


Arg 


Lys 


Leu 




275 










280 










285 








Pro Glu 


Ala 


Asn 


He 


He 


Gin 


Thr 


Val 


Asp 


Ser 


He 


Arg 


Ala 


Lys 


Leu 


290 










295 










300 










Pro Glu 


Leu 


Gin 


Glu 


Thr 


He 


Pro 


Ala 


Ala 


He 


Asp 


Leu 


Gin 


He 


Ala 


305 








310 










315 










320 


Gin Asp 


Arg 


Ser 


Pro 


Thr 


He 


Arg 


Ala 


Ser 


Leu 


Glu 


Glu 
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15 




He 


Asn 


Glu 


Thr 


Leu 


Pro 


Ala 


Asn 


Gly 


He 


Thr 


Ala 


He 


Phe 


Gly 


Val 








20 










25 










30 






Ser 


Gly 


Ala 


Gly 


Lys 


Thr 


Ser 


Leu 


He 


Asn 


Ala 


He 


Ser 


Gly 


Leu 


Thr 






35 










40 










45 








Arg 


Pro 


Gin 


Lys 


Gly 


Arg 


He 


Val 


Leu 


Asn 


Gly 


Arg 


Val 


Leu 


Asn 


Asp 




50 










55 










60 










Ala 


Glu 


Lys 


Gly 


He 


Cys 


Leu 


Thr 


Pro 


Glu 


Lys 


Arg 


Arg 


Val 


Gly 


Tyr 


65 










70 










75 










80 


Val 


Phe 


Gin 


Asp 


Ala 


Arg 


Leu 


Phe 


Pro 


His 


Tyr 


Lys 


Val 


Arg 


Gly 


Asn 










85 










90 










95 




Leu 


Arg 


Tyr 


Gly 


Met 


Ser 


Lys 


Ser 


Met 


Val 


Asp 


Gin 


Phe 


Asp 


Lys 


Leu 








100 










105 










110 






Val 


Ala 


Leu 


Leu 


Gly 


He 


Glu 


Pro 


Leu 


Leu 


Asp 


Arg 


Leu 


Pro 


Gly 


Ser 






115 










120 










125 








Leu 


Ser 


Gly 


Gly 


Glu 


Lys 


Gin 


Arg 


Val 


Ala 


He 


Gly 


Arg 


Ala 


Leu 


Leu 




130 










135 










140 










Thr 


Ala 


Pro 


Glu 


Leu 


Leu 


Leu 


Leu 


Asp 


Glu 


Pro 


Leu 


Ala 


Ser 


Leu 


Asp 


145 










150 










155 










160 


He 


Pro 


Arg 


Lys 


Arg 


Glu 


Leu 


Leu 


Pro 


Tyr 


Leu 


Gin 


Arg 


Leu 


Thr 


Arg 










165 










170 










175 




Glu 


He 


Asn 


He 


Pro 


Met 


Leu 


Tyr 


Val 


Ser 


His 


Ser 


Leu 


Asp 


Glu 


He 
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180 






Leu 


His 


Leu 


Ala 


Asp 


Arg Val Met 






195 






200 


Ala 


Phe 


Gly 


Ala 


Leu 


Glu Glu Val 




210 








215 


Trp 


Leu 


Pro 


Lys 


Glu 


Gin Gin Ser 


225 










230 


Glu 


His 


His 


Pro 


His 


Tyr Ala Met 










245 




His 


Leu 


Trp 


Val 


Asn 


Lys Leu Asp 








260 




* 


He 


Arg 


lie 


Gin 


Ala 


Ser Asp Val 






275 






280 


Gin 


Thr 


Ser 


He 


Arg 


Asn Val Leu 




290 








295 


Asp 


Asp 


Asn 


Gly 


Gin 


Val Glu Val 


305 










310 


Leu 


Trp 


Ala 


Arg 


He 


Ser Pro Trp 










325 




Pro 


Gly 


Leu 


Trp 


Leu 


Tyr Ala Gin 








340 







<210> 449 
<211> 168 
<212> PRT 

<213> Escherichia coli 

<400> 449 
Met Asn Ser Val He Thr Gin Lys 

1 5 
Ala Asp Thr Lys Thr Gly Gly Phe 
20 

He Asp He Ser Asp Ser Glu Leu 

35 40 
Glu Glu Glu Ala Lys He Asp Asp 

50 55 
He Pro Ser Leu Pro Tyr Tyr Ala 
65 70 
Glu Leu Pro Ala Met Asp Asp Leu 
85 

Glu Glu He He Lys Lys Phe Lys 
100 

His Val Glu Glu Gly Ser Pro Lys 
115 120 
Lys He Pro Ala His Met He He 

130 135 
Thr Thr Tyr Leu Leu Gly Ser Asn 
145 150 
Glu Cys Ser Val Leu Val Val Arg 
165 

<210> 450 
<211> 377 
<212> PRT 

<213> Escherichia coli 
<400> 450 

Met Lys Ser Lys Val Leu Ala Leu 

1 5 
Gly Ala Ala His Ala Ala Glu Val 



185 190 
Val Leu Glu Asn Gly Gin Val Lys 
205 

Trp Gly Ser Ser Val Met Asn Pro 
220 

Ser lie Leu Lys Val Thr Val Leu 
235 240 
Thr Ala Leu Ala Leu Gly Asp Gin 

250 255 
Glu Pro Leu Gin Ala Ala Leu Arg 
265 270 
Ser Leu Val Leu Gin Pro Pro Gin 
285 

Arg Ala Lys Val Val Asn Ser Tyr 
300 

Glu Leu Glu Val Gly Gly Lys Thr 
315 320 
Ala Arg Asp Glu Leu Ala He Lys 

330 335 
He Lys Ser Val Ser He Thr Ala 
345 350 



Val Ser Ser Gly Val Thr Leu Tyr 

10 " 15 

Met Asn Arg Thr He Leu Val Pro 
25 30 
Thr Gin Arg Val He Ser His Val 
45 

Ala Glu Val His Phe Leu Thr Val 
60 

Ser Leu Gly Leu Ala Tyr Ser Ala 

75 80 
Lys Ala Glu Ala Lys Ser Gin Leu 

90 95 
Leu Pro Thr Asp Arg Val His Val 
105 110 
Asp Arg He Leu Glu Leu Ala Lys 
125 

He Ala Ser His Arg Pro Asp He 
140 

Ala Ala Ala Val Val Arg His Ala 
155 160 



Leu He Pro Ala Leu Leu Ala Ala 

10 15 
Tyr Asn Lys Asp Gly Asn Lys Leu 
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20 25 30 



Asp 


Leu 


Tyr 


Gly Lys Val Asp 


Gly 


Leu 


His 


Tyr 


Phe 


Ser 


Asp Asn 


Ser 






35 




40 










45 






Ala 


Lys 


Asp 


Gly Asp Gin Ser 


Tyr 


Ala 


Arg 


Leu 


Gly 


Phe 


Lys Gly 


Glu 




50 




55 










60 








Thr 


Gin 


He 


Asn Asp Gin Leu 


Thr 


Gly 


Tyr 


Gly 


Gin 


Trp 


Glu Tyr 


Asn 


65 






70 








75 








80 


He 


Gin 


Ala 


Asn Asn Thr Glu 


Ser 


Ser 


Lys 


Asn 


Gin 


Ser 


Trp Thr 


Arg 








85 






90 








95 




Leu 


Ala 


Phe 


Ala Gly Leu Lys 


Phe 


Ala 


Asp 


Tyr 


Gly 


Ser 


Phe Asp 


Tyr 








100 




105 










110 




Gly 


Arg 


Asn 


Tyr Gly Val Met 


Tyr 


Asp 


He 


Glu 


Gly 


Trp 


Thr Asp 


Met 






115 




120 










125 






Leu 


Pro 


Glu 


Phe Gly Gly Asp 


Ser 


Tyr 


Thr 


Asn 


Ala 


Asp 


Asn Phe 


Met 




130 




135 










140 








Thr 


Gly 


Arg 


Ala Asn Gly Val 


Ala 


Thr 


Tyr 


Arg 


Asn 


Thr 


Asp Phe 


Phe 


145 






150 








155 








160 


Gly 


Leu 


Val 


Asn Gly Leu Asn 


Phe 


Ala 


Val 


Gin 


Tyr 


Gin 


Gly Asn 


Asn 






165 






170 








175 




Glu 


Gly 


Ala 


Ser Asn Gly Gin 


Glu 


Gly 


Thr 


Asn 


Asn 


Gly 


Arg Asp 


Val 








180 




185 










190 




Arg 


His 


Glu 


Asn Gly Asp Gly 


Trp 


Gly 


Leu 


Ser 


Thr 


Thr 


Tyr Asp 


Leu 






195 




200 










205 






Gly 


Met 


Gly 


Phe Ser Ala Gly 


Ala 


Ala 


Tyr 


Thr 


Ser 


Ser 


Asp Arg 


Thr 


210 




215 










220 








Asn 


Asp 


Gin 


Val Asn His Thr 


Ala 


Ala 


Gly 


Gly 


Asp 


Lys 


Ala Asp 


Ala 


225 






230 








235 








240 


Trp 


Thr 


Ala 


Gly Leu Lys Tyr 


Asp 


Ala 


Asn 


Asn 


He 


Tyr 


Leu Ala 


Thr 






245 






250 








255 




Met 


Tyr 


Ser 


Glu Thr Arg Asn 


Met 


Thr 


Pro 


Phe 


Gly 


Asp 


Ser Asp 


Tyr 






260 




265 










270 




Ala 


Val 


Ala 


Asn Lys Thr Gin 


Asn 


Phe 


Glu 


Val 


Thr 


Ala 


Gin Tyr 


Gin 






275 




280 










285 






Phe 


Asp 


Phe 


Gly Leu Arg Pro 


Ala 


Val 


Ser 


Phe 


Leu 


Met 


Ser Lys 


Gly 




290 




295 










300 








Arg 


Asp 


Leu 


His Ala Ala Gly 


Gly 


Ala 


Asp 


Asn 


Pro 


Ala 


Gly Val 


Asp 


305 






310 








315 








320 


Asp 


Lys 


Asp 


Leu Val Lys Tyr 


Ala 


Asp 


lie 


Gly 


Ala 


Thr 


Tyr Tyr 


Phe 








325 






330 








335 




Asn 


Lys 


Asn 


Met Ser Thr Tyr 


Val 


Asp 


Tyr 


Lys 


He 


Asn 


Leu Leu 


Asp 








340 




345 










350 




Glu 


Asp 


Asp 


Ser Phe Tyr Ala 


Ala 


Asn 


Gly 


He 


Ser 


Thr 


Asp Asp 


He 






355 




360 










365 






Val 


Ala 


Leu 


Gly Leu Val Tyr 


Gin 


Phe 
















370 




375 


















<210> 451 




















<211> 1122 




















<212> PRT 




















<213> Escherichia coli 


















<400> 451 




















Met Leu Arg Val Tyr His Ser 


Asn 


Arg 


Leu 


Asp 


Val 


Leu 


Glu Ala 


Leu 


1 






5 






10 








15 




Met 


Glu 


Phe 


He Val Glu Arg 


Glu 


Arg 


Leu 


Asp 


Asp 


Pro 


Phe Glu 


Pro 








20 




25 










30 




Glu 


Met 


He 


Leu Val Gin Ser 


Thr 


Gly 


Met 


Ala 


Gin 


Trp 


Leu Gin 


Met 






35 




40 










45 






Thr Leu Ser Gin Lys Phe Gly 


He 


Ala 


Ala 


Asn 


He 


Asp 


Phe Pro 


Leu 




50 




55 










60 
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rj __ _ 

Pro 


Ala 


Ser 


raw a 

pne 


lie 


Trp 


Cc 

Dj 










f u 


Pro 


Lys 


Glu 


ser 


Ala 


pne 










q t 

OD 




Thr 


Leu 


Leu 


Pro 


f 1 M 

Gin 


Leu 








1UU 






His 


Tyr 


Leu 


Thr 


Asp 


Asp 






115 








Ser 


Lys 


Ala 


Ala 


Asp 


Leu 




130 










Trp 


Leu 


Ala 


Gin 


Trp 


GlU 


145 










1DU 


Ala 


Gin 


Ala 


Trp 


Gin 


TV 1 -s 

Ala 










165 




His 


Gin 


Leu 


Gly 


Gin 


Pro 








ion 

180 






Phe 


lie 


Glu 


Thr 


Leu 


Glu 






195 








Ser 


Arg 


Val 


Phe 


lie 


Cys 




210 










Gin 


Ala 


Leu 


Gin 


TV 1 -» 

Ala 


Leu 


Zzb 










Z JU 


Thr 


Asn 


Pro 


Cys 


Arg 


Tyr 










O A^ 
Z ft O 




Leu 


Ala 


Lys 


Leu 


Leu 


Tnr 








260 






Glu 


Leu 


Pro 


Leu 


Pne 


Arg 






275 








Ser 


Asp 


Gly 


Glu 


Gin 


T» — .— 

Asp 




290 










Lys 


Leu 


Gly 


Arg 


Asp 


Tyr 


305 










Tin 

310 


Gin 


Glu 


Leu 


Asp 


71 "1 — 

Ala 


pne 










Jzb 




Asn 


lie 


Gin 


Ser 


Asp 


lie 








o4 U 






vai 


Asn 


T 1 ^ 

lie 


blU 


blU 


pne 






o a c 
JJJ 








Pro 


Leu 


Asp 


Ser 


Ser 


lie 




IT ft 
O / U 










Glu 


Val 


Glu 


Val 


Leu 


His 


385 










O A 

390 


Pro 


Thr 


Leu 


Thr 


Pro 


Arg 










405 




Ser 


Tyr 


Ser 


Pro 


Phe 


Ti- 
ne 








/on 
4zU 






Arg 


Tyr 


Leu 


Pro 


Tyr 


/via 






A Q C 








Pro 


vai 


Leu 


blU 


Ala 


pne 




A CA 

4bU 










Phe 


vai 


Ser 


GlU 


Asp 


vai 


A £R 
HOD 










din 


Ala 


Arg 


Phe 


Asp 


He 


Thr 










485 




Val 


Asn 


Glu 


Ser 


Gly 


He 








500 






Glu 


Leu 


Glu 


Leu 


Pro 


Ala 






515 








Thr 


Arg 


Met 


Leu 


Leu 


Gly 



530 



Asp 


«et 


pne 


vai 


TV %~n 

Arg 


vai 










l 3 




Asn 


Lys 


bin 


ber 


wet 


ser 














Leu 


blU 


Arg 


blU 


Asp 


pne 






1UD 








ber 


Asp 


Lys 


Arg 


Lys 


Leu 














Pne 


Asp 


bin 


Tyr 


Leu 


Val 


1 oc 

lOD 










14U 


Tnr 


Gly 


Mi- 
nis 


Leu 


val 


GlU 










li>J 




Pro 


Leu 


Trp 


Lys 


7\ T _ 

Ala 


Leu 








1 f\j 






Arg 


Trp 


HlS 


7\ _ 

Arg 


TV 1 — 

Ala 


Asn 






i pc; 

10 J 








Ser 


Ala 


xnr 


rp l_ 

Tnr 


Cys 


Pro 














Gly 


lie 


Ser 


Ala 


Leu 


Pro 












ion 


biy 


Lys 


HIS 


T 1 _ 

lie 


blU 


Ti- 
ne 




. .. ■ 




■ 


*? ^ R 




Tyr 


Trp 


biy 


Asp 


Ti- 
ne 


Lys 














Arg 


bin 


Arg 


Arg 


nlS 


Ser 






£OD 








Asp 


Ser 


blU 


TV 

Asn 


Ala 


biy 




ZoU 










vai 


biy 


TV — _ 

Asn 


Pro 


Leu 


Leu 


Z !7D 










JVJU 


T 1 

lie 


Tyr 


Leu 


Leu 


Ser 


Asp 










Jl 3 




val 


ASp 


val 


i nr 


Pro 


Asp 








jjU 






Leu 


Pin 

blU 


Leu 


blU 


Asn 


Arg 






o4 □ 








C n v 

ber 


Arg 


ber 


Asp 


Asn 


Lys 




JbU 










Thr 


Pne 


His 


Val 


Cy3 


His 


O / D 












Asp 


Arg 


Leu 


T ins 

Leu 


TV 1 -» 

Aia 


Met 










395 




Asp 


lie 


Ti- 
ne 


XT — 1 

vai 


Met 


t 7 _ 1 

val 








4 10 






Gin 


71 1 _ 

Ala 


val 


pne 


biy 


O -w 

ber 






A 95 








lie 


ber 


Asp 


Arg 


Arg 


Aia 




4 4 0 










lie 


Cat* 

ber 


Leu 


Leu 


Ser 


Leu 


A G. Q 

400 










4 60 


Leu 


Aia 


Leu 


Leu 


Asp 


vai 














Glu 


blU 


biy 


Leu 


Arg 


Tyr 








490 






Arg 


Trp 


Gly 


lie 


Asp 


Asp 






505 








Thr 


Gly 


Gin 


His 


Thr 


Trp 




520 










Tyr 


Ala 


Met 


Glu 


Ser 


Ala 


535 










540 
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Leu 


Pro 


blU 


Ti- 
ne 








80 


Trp 


Lys 


Leu 


Met 






yj 




Tnr 


Leu 


Leu 


TV _ _ 

Arg 




1 1 n 

11U 






nL — 

pne 


pi - 
Gin 


Leu 


Ser 


lc J 








Tyr 


Arg 


Pro 


TV »m 

ASp 


biy 


Leu 


biy 


blU 








160 


Val 


Glu 


Tyr 


Thr 






lie 

1 / b 




Tax 

Leu 


Tyr 


Gin 


Arg 




i yu 






Pro 


Gly 


Leu 


Pro 


Z Uj 








Pro 


Val 


Tyr 


Leu 


His 


Leu 


Leu 


Pne 








240 


Asp 


Pro 


Ala 


Tyr 






zbb 




pne 


blU 


ASp 


Arg 




o "7 n 
Z / U 






bin 


Leu 


nu — 

pne 


TV 

Asn 


ZOO 








Aia 


ber 


Trp 


biy 


Leu 


blU 


Ser 


ber 








320 


Asn 


Leu 


Leu 


His 










ni a 

Aia 


w a 1 

vai 


Aia 


biy 




JJU 






Arg 


Pro 


Leu 


Asp 


ODD 








Ser 


Pro 


bin 


Arg 


Leu 


pi 

GlU 


GlU 


Asp 








400 


Ala 


Asp 


He 


Asp 






41b 




Aia 


Pro 


Aia 


Asp 




4 "30 
ft OvJ 






Arg 


bin 


ber 


nlS 


4 A S 








Pro 


Asp 


ber 


Arg 


Pro 


vai 


Leu 


Aia 








480 


Leu 


Arg 


Gin 


Trp 






495 




Asp 


Asn 


Val 


Arg 




510 






Arg 


Phe 


Gly 


Leu 


525 








Gin 


Gly 


Glu 


Trp 
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Gin 


Ser 


Val 


Leu 


Pro 


Tyr 


Asp 


Glu 


Ser 


Ser 


Gly 


Leu 


He 


Ala 


Glu 


Leu 


545 










550 










555 










560 


Val 


Gly 


His 


Leu 


Ala 


Ser 


Leu 


Leu 


Met 


Gin 


Leu 


Asn 


He 


Trp 


Arg 


Arg 










565 










570 










575 




Gly 


Leu 


Ala 


Gin 


Glu 


Arg 


Pro 


Leu 


Glu 


Glu 


Trp 


Leu 


Pro 


Val 


Cys 


Arg 








580 










585 










590 






Asp 


Met 


Leu 


Asn 


Ala 


Phe 


Phe 


Leu 


Pro 


Asp 


Ala 


Glu 


Thr 


Glu 


Ala 


Ala 






595 










600 










605 








Met 


Thr 


Leu 


He 


Glu 


Gin 


Gin 


Trp 


Gin 


Ala 


He 


lie 


Ala 


Glu 


Gly 


Leu 




610 










615 










620 










Gly 


Ala 


Gin 


Tyr 


Gly 


Asp 


Ala 


Val 


Pro 


Leu 


Ser 


Leu 


Leu 


Arg 


Asp 


Glu 


625 










630 










635 










640 


Leu 


Ala 


Gin 


Arg 


Leu 


Asp 


Gin 


Glu 


Arg 


He 


Ser 


Gin 


Arg 


Phe 


Leu 


Ala 










645 










650 










655 




Gly 


Pro 


Val 


Asn 


He 


Cys 


Thr 


Leu 


Met 


Pro 


Met 


Arg 


Ser 


He 


Pro 


Phe 








660 










665 










670 






Lys 


Val 


Val 


Cys 


Leu 


Leu 


Gly 


Met 


Asn 


Asp 


Gly 


Val 


Tyr 


Pro 


Arg 


Gin 






675 










680 










685 








Leu 


Ala 


Pro 


Leu 


Gly 


Phe 


Asp 


Leu 


Met 


Ser 


Gin 


Lys 


Pro 


Lys 


Arg 


Gly 




690 










695 










700 










Asp 


Arg 


Ser 


Arg 


Arg 


Asp 


Asp 


Asp 


Arg 


Tyr 


Leu 


Phe 


Leu 


Glu 


Ala 


Leu 


705 










710 










715 










720 


lie 


Ser 


Ala 


Gin 


Gin 


Lys 


Leu 


Tyr 


He 


Ser 


Tyr 


He 


Gly 


Arg 


Ser 


He 










725 










730 










735 




Gin 


Asp 


Asn 


Ser 


Glu 


Arg 


Phe 


Pro 


Ser 


Val 


Leu 


Val 


Gin 


Glu 


Leu 


He 








740 










745 










750 






Asp 


Tyr 


He 


Gly 


Gin 


Ser 


His 


Tyr 


Leu 


Pro 


Gly 


Asp 


Glu 


Ala 


Leu 


Asn 






755 










760 










765 








Cys 


Asp 


Glu 


Ser 


Glu 


Ala 


Arg 


Val 


Lys 


Ala 


His 


Leu 


Thr 


Cys 


Leu 


His 




770 










775 










780 










Thr 


Arg 


Met 


Pro 


Phe 


Asp 


Pro 


Gin 


Asn 


Tyr 


Gin 


Pro 


Gly 


Glu 


Arg 


Gin 


785 










790 










795 










800 


Ser 


Tyr 


Ala 


Arg 


Glu 


Trp 


Leu 


Pro 


Ala 


Ala 


Ser 


Gin 


Ala 


Gly 


Lys 


Ala 










805 










810 










815 




His 


Ser 


Glu 


Phe 


Val 


Gin 


Pro 


Leu 


Pro 


Phe 


Thr 


Leu 


Pro 


Glu 


Thr 


Val 








820 










825 










830 






Pro 


Leu 


Glu 


Thr 


Leu 


Gin 


Arg 


Phe 


Trp 


Ala 


His 


Pro 


Val 


Arg 


Ala 


Phe 






835 










840 










845 








Phe 


Gin 


Met 


Arg 


Leu 


Gin 


Val 


Asn 


Phe 


Arg 


Thr 


Glu 


Asp 


Ser 


Glu 


He 




850 










855 










860 










Pro 


Asp 


Thr 


Glu 


Pro 


Phe 


He 


Leu 


Glu 


Gly 


Leu 


Ser 


Arg 


Tyr 


Gin 


He 


865 










870 










875 










880 


Asn 


Gin 


Gin 


Leu 


Leu 


Asn 


Ala 


Leu 


Val 


Glu 


Gin 


Asp 


Asp 


Ala 


Glu 


Arg 










885 










890 










895 




Leu 


Phe 


Arg 


Arg 


Phe 


Arg 


Ala 


Ala 


Gly 


Asp 


Leu 


Pro 


Tyr 


Gly 


Ala 


Phe 








900 










905 










910 






Gly 


Glu 


He 


Phe 


Trp 


Glu 


Thr 


Gin 


Cys 


Gin 


Glu 


Met 


Gin 


Gin 


Leu 


Ala 






915 










920 










925 








Asp 


Arg 


Val 


He 


Ala 


Cys 


Arg 


Gin 


Pro 


Gly 


Gin 


Ser 


Met 


Glu 


He 


Asp 




930 










935 










940 










Leu 


Ala 


Cys 


Asn 


Gly 


Val 


Gin 


He 


Thr 


Gly 


Trp 


Leu 


Pro 


Gin 


Val 


Gin 


945 










950 










955 










960 


Pro 


Asp 


Gly 


Leu 


Leu 


Arg 


Trp 


Arg 


Pro 


Ser 


Leu 


Leu 


Ser 


Val 


Ala 


Gin 










965 










970 










975 




Gly 


Met 


Gin 


Leu 


Trp 


Leu 


Glu 


His 


Leu 


Val 


Tyr 


Cys 


Ala 


Ser 


Gly 


Gly 








980 










985 










990 






Asn 


Gly 


Glu 


Ser 


Arg 


Leu 


Phe 
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1090 1095 1100 

lie Val Glu Gin Ser Gin Arg Phe Leu Leu Pro Leu Phe Arg Phe Asn 
1105 1110 1115 1120 

Gin Ser 



<210> 452 
<211> 107 
<212> PRT 

<213> Escherichia coli 
<400> 452 



Met 


Ser 


Ala 


Ser 


Leu 


Lys 


Asn Gin Gin Gly Phe Ser Leu Pro Glu Val 


1 








5 




.10 15 


Met 


Leu 


Ala 


Met 
20 


Val 


Leu 


Met Val Met He Val Thr Ala Leu Ser Gly 
25 30 


Phe 


Gin 


Arg 
35 


Thr 


Leu 


Met 


Asn Ser Leu Ala Ser Arg Asn Gin Tyr Gin 
40 45 


Gin 


Leu 
50 


Trp 


Arg 


His 


Gly 


Trp Gin Gin Thr Gin Leu Arg Ala He Ser 
55 60 


Pro 


Pro 


Ala 


Asn 


Trp 


Gin 


Val Asn Arg Met Gin Thr Ser Gin Ala Gly 


65 










70 


75 80 


Cys 


Val 


Ser 


He 


Ser 


Val 


Thr Leu Val Ser Pro Gly Gly Arg Glu Gly 








85 




• 90 95 


Glu 


Met 


Thr 


Arg 
100 


Leu 


His 


Cys Pro Asn Arg Gin 
105 



<210> 453 
<211> 121 
<212> PRT 

<213> Escherichia coli 
<400> 453 



Met 


Leu 


Leu 


Val 


Leu 


Gly 


Ser Leu Leu Leu Gin Gly Met Ser 


Gin Gin 


1 








5 




10 


15 


Asp 


Arg 


Ser 


Phe 


Ala 


Ser 


Arg Val Ser Met Glu Ser Gin Ser 


Leu Arg 




20 






25 30 




Arg 


Gin 


Ala 


He 


Val 


Gin 


Ser Ala Leu Ala Trp Gly Lys Met 


His Cys 




35 








40 45 




Trp 


Gin 


Thr 


Gin 


Pro 


Ala 


Val Gin Cys Ser Gin Tyr Ala Glu 


Thr Asp 


50 










55 60 




Ala 


Gin 


Val 


Cys 


Leu 


Arg 


Leu Leu Ala Asp Asn Glu Ala Leu 


Leu He 


65 








70 


75 


80 


Ala 


Gly 


Tyr 


Glu 


Gly 


Val 


Ser Leu Trp Arg Thr Gly Glu Val 


He Asp 






85 




90 


95 


Gly 


Asn 


He 


Val 


Phe 


Ser 


Pro Arg Gly Trp Ser Asp Phe Cys 


Pro Leu 






100 






105 HO 




Lys 


Glu 


Arg 


Ala 


Leu 


Cys 


Gin Leu Pro 






115 








120 





<210> 454 



-407- 



WO 01/34810 



PCT7USO0/3095O 



<211> 187 
<212> PRT 

<213> Escherichia coli 



<400> 454 



Met 


Pro 


Val 


Lys 


Glu 


Gin Gly 


Phe 


Ser 


Leu 


Leu 


Glu 


Val 


Leu 


He 


Ala 


1 








5 










10 










15 




Met 


Ala 


He 


Ser 


Ser 


Val 


Leu 


Leu 


Leu Gly 


Ala 


Ala 


Arg 


Phe 


Leu 


Pro 








20 










25 










30 






Ala 


Leu 


Gin 


Arg 


Glu 


Ser 


Leu 


Thr 


Ser 


Thr 


Arg 


Lys 


Leu 


Ala 


Leu 


Glu 






35 










40 










45 








Asp 


Glu 


He 


Trp 


Leu 


Arg 


Val 


Phe 


Thr 


Val 


Ala 


Lys 


His 


Leu 


Gin 


Arg 




50 










55 










60 










Ala 


Gly 


Tyr 


Cys 


His 


Gly 


He 


Cys 


Thr 


Gly 


Glu 


Gly 


Leu 


Glu 


He 


Val 


65 










70 










75 










80 


Gly 


Gin 


Gly 


Asp 


Cys 


Val 


lie 


Val 


Gin 


Trp 


Asp 


Ala 


Asn 


Ser 


Asn 


Gly 










85 










90 










95 




lie 


Trp 


Asp 


Arg 


Glu 


Pro 


Val 


Lys 


Glu 


Ser 


Asp 


Gin 


He 


Gly 


Phe 


Arg 








100 










105 










110 






Leu 


Lys 


Glu 


His 


Val 


Leu 


Glu 


Thr 


Leu Arg 


Gly 


Ala 


Thr 


Ser 


Cys 


Glu 






115 










120 










125 








Gly 


Lys 


Gly 


Trp 


Asp 


Lys 


Val 


Thr 


Asn 


Pro 


Asp 


Ala 


He 


He 


He 


Asp 




130 










135 










140 










Thr 


Phe 


Gin 


Val 


Val 


Arg Gin Asp 


Val 


Ser 


Gly 


Phe 


Ser 


Pro 


Val 


Leu 


145 










150 










155 










160 


Thr 


Val 


Asn 


Met 


Arg 


Ala 


Ala 


Ser 


Lys 


Ser 


Glu 


Pro 


Gin 


Thr 


Val 


Val 










165 










170 










175 




Asn 


Ala 


Ser 


Tyr 


Ser 


Val 


Thr Gly 


Phe 


Asn 


Leu 


















180 










185 
















<210> 455 




























<211> 156 




























<212> PRT 




























<213> Escherichia coli 






















<400> 455 




























Met 


Lys 


Thr 


Gin Arg Gly Tyr Thr 


Leu 


He 


Glu 


Thr 


Leu 


Val 


Ala 


Met 


1 








5 










10 










15 




Leu 


He 


Leu 


Val 


Met 


Leu 


Ser 


Ala 


Ser Gly 


Leu 


Tyr 


Gly 


Trp 


Gin 


Tyr 








20 










25 










30 






Trp 


Gin 


Gin 


Ser 


Gin Arg Leu Trp 


Gin 


Thr 


Ala 


Ser 


Gin 


Ala 


Arg 


Asp 






35 










40 










45 








Tyr 


Leu 


Leu 


Tyr 


Leu Arg Glu Asp 


Ala 


Asn 


Trp 


His 


Asn 


Arg 


Asp 


His 




50 










55 










60 










Ser 


He 


Ser 


Val 


He Arg Glu Gly 


Thr 


Leu 


Trp 


Cys 


Leu 


Val 


Ser 


Ser 


65 










70 










75 










80 


Ala 


Ala 


Gly 


Ala Asn Thr Cys 


His 


Gly Ser 


Ser 


Pro 


Leu 


Val 


Phe 


Val 










85 










90 










95 




Pro 


Arg 


Trp 


Pro 


Glu 


Val 


Glu 


Met 


Ser Asp 


Leu 


Thr 


Pro 


Ser 


Leu 


Ala 








100 










105 










110 






Phe 


Phe 


Gly 


Leu 


Arg 


Asn 


Thr 


Ala 


Trp 


Ala 


Gly 


His 


He 


Arg 


Phe 


Lys 






115 










120 










125 








Asn 


Ser 


Thr 


Gly Glu Trp Trp Leu 


Val 


Val 


Ser 


Pro 


Trp 


Gly 


Arg 


Leu 




130 










135 










140 










Arg 


Leu 


Cys 


Gin Gin Gly Glu Thr 


Glu Gly 


Cys 


Leu 










145 










150 










155 













<210> 456 
<211> 711 
<212> PRT 



-408- 



WO 01/34810 



PCT/US00/30950 



<213> Escherichia coli 



<400> 456 



Met 


Ser 


Thr 


lie 


Val 


He 


Phe 


Leu 


Ala 


Ala 


Leu 


Leu 


Ala 


Cys 


Ser 


Leu 


1 








5 










10 










15 




Leu 


Ala 


Gly 


Trp 


Leu 


He 


Lys 


Val 


Arg 


Ser 


Arg 


Arg 


Arg 


Gin 


Leu 


Pro 








20 










25 










30 






Trp 


Thr 


Asn 


Ala 


Phe 


Ala 


Asp 


Ala 


Gin 


Thr 


Arg 


Lys 


Leu 


Thr 


Pro 


Glu 






35 










40 










45 








Glu 


Arg 


Ser 


Ala 


Val 


Glu 


Asn 


Tyr 


Leu 


Glu 


Ser 


Leu 


Thr 


Gin 


Val 


Leu 




50 










55 










60 










Gin 


Val 


Pro 


Gly 


Pro 


Thr 


Gly 


Ala 


Ser 


Ala 


Ala 


Pro 


He 


Ser 


Leu 


Ala 


65 










70 










75 










80 


Leu 


Asn 


Ala 


Glu 


Ser 


Asn 


Asn 


Val 


Met 


Met 


Leu 


Thr 


His 


Ala 


He 


Thr 










85 










90 










95 




Arg 


Tyr 


Gly 


He 


Ser 


Thr 


Asp 


Asp 


Pro 


Asn 


Lys 


Trp 


Arg 


Tyr 


Tyr 


Leu 








100 










105 










110 






Asp 


Ser 


Val 


Glu 


Val 


His 


Leu 


Pro 


Pro 


Phe 


Trp 


Glu 


Gin 


Tyr 


He 


Asn 






115 










120 










125 








Asp 


Glu 


Asn 


Thr 


Val 


Glu 


Leu 


He 


His 


Thr 


Asp 


Ser 


Leu 


Pro 


Leu 


Val 


130 










135 










140 










lie 


Ser 


Leu 


Asn 


Gly 


His 


Thr 


Leu 


Gin 


Glu 


Tyr 


Met 


Gin 


Glu 


Thr 


Arg 


145 










150 










155 










160 


Ser 


Tyr 


Ala 


Leu 


Gin 


Pro 


Val 


Pro 


Ser 


Thr 


Gin 


Ala 


Ser 


He 


Arg 


Gly 








165 










170 










175 




Glu 


Glu 


Ser 


Glu 


Gin 


He 


GlU 


Leu 


Leu 


Asn 


Tl- 
IlS 


Arg 


Lys 


GlU 


Thr 


HIS 








180 










185 










190 






Glu 


Glu 


Tyr 


Ala 


Leu 


Ser 


Arg 


Pro 


Arg 


Gly 


Leu 


Arg 


Glu 


Ala 


Leu 


Leu 






195 










200 










205 








lie 


Val 


Ala 


Ser 


Phe 


Leu 


Met 


Phe 


Phe 


Phe 


Cys 


Leu 


lie 


Thr 


Pro 


Asp 




210 










215 










220 










Val 


Phe 


Val 


' Pro 


Trp 


Leu 


Ala 


Gly 


(nly 


7\ "1 _ 

Ala 


Leu 


Leu 


Leu 


T All 

Leu 


Gly 


Ala 


225 










230 










o o c 

235 










240 


Gly 


Leu 


Trp 


Gly 


Leu 


Phe 


Ala 


Pro 


Pro 


Ala 


Lys 


Ser 


Ser 


Leu 


Arg 


Glu 










245 










250 










255 




lie 


His 


Cys 


Leu 


Arg 


Gly 


Thr 


Pro 


Arg 


Arg 


Trp 


Gly 


Leu 


Phe 


Gly 


Glu 








260 










265 










270 






Asn 


Asp 


Gin 


Glu 


Gin 


He 


Asn 


Asn 


He 


Ser 


Leu 


Gly 


He 


He 


Asp 


Leu 






275 










280 










285 








Val 


Tyr 


Pro 


Ala 


His 


Trp 


Gin 


Pro 


Tyr 


He 


Ala 


Gin 


Asp 


Leu 


Gly 


Gin 




290 










295 










300 










Gin 


Thr 


Asp 


He 


Asp 


ne 


Tyr 


Leu 


Asp 


Arg 


His 


val 


Val 


Arg 


Gin 


Gly 


305 










310 










315 










■air* 


Arg 


Tyr 


Leu 


Ser 


Leu 


His 


Asp 


blU 


val 


Lys 


Asn 


Phe 


Pro 


Leu 


bin 


His 










325 










330 










T% n c 

335 




Trp 


Leu 


Arg 


Ser 


Thr 


He 


He 


Ala 


TV 1 — . 

Ala 


Gly 


Ser 


Leu 


Leu 


val 


Leu 


Phe 








340 










345 










T r f\ 

350 






Met 


Leu 


Leu 


Phe 


Trp 


He 


Pro 


Leu 


Asp 


Met 


Pro 


Leu 


Lys 


Phe 


Thr 


Leu 






355 










360 










365 








Cor 


Trp 




Lys 




nla 




Th-r 
1x1X7 


Tip 


\j -LU 


nla 


Thr 
1 1 IX 


Cor 


vox 


Lys 


fil n 

V7XJ1 




370 










375 










380 










Leu 


Ala 


Asp 


Ala 


Gly 


Val 


Arg 


Val 


Gly 


Asp 


Thr 


Leu 


Arg 


lie 


Ser 


Gly 


385 










390 










395 










400 


Thr 


Gly 


Met 


Cys 


Asn 


He 


Arg 


Thr 


Ser 


Gly 


Thr 


Trp 


Ser 


Ala 


Lys 


Thr 










405 










410 










415 




Asn 


Ser 


Pro 


Phe 


Leu 


Pro 


Phe 


Asp 


Cys 


Ser 


Gin 


lie 


He 


Trp 


Asn 


Asp 








420 










425 










430 






Ala 


Arg 


Ser 


Leu 


Pro 


Leu 


Pro 


Glu 


Ser 


Glu 


Leu 


Val 


Asn 


Lys 


Ala 


Thr 






435 










440 










445 








Ala 


Leu 


Thr 


Glu 


Ala 


Val 


Asn 


Arg 


Gin 


Leu 


His 


Pro 


Lys 


Pro 


Glu 


Asp 



-409- 



WO 01/34810 



PCT7US00/30950 



450 455 460 



Glu 


Ser 


Arg Val 


Ser 


Ala 


Ser 


Leu 


Arg 


Ser 


Ala 


He 


Gin 


Lys 


Ser 


Gly 


4 65 










470 










475 










480 


Met 


Val 


Leu 


Leu Asp 


Asp 


Phe 


Gly 


Asp 


He 


Val 


Leu 


Lys 


Thr 


Ala 


Asp 










485 










490 










495 




Leu 


Cys 


Ser 


Ala 


Lys 


Asp 


Asp 


Cys 


Val 


Arg 


Leu 


Lys 


Asn 


Ala 


Leu 


Val 








500 










505 










510 






Asn 


Leu 


Gly Asn Ser 


Lys 


Asp 


Trp 


Asp 


Ala 


Leu 


Val 


Lys 


Arg 


Ala 


Asn 






515 










520 










525 








Ala 


Gly 


Lys 


Leu Asp 


Gly 


Val 


Asn 


Val 


Leu 


Leu 


Arg 


Pro 


Val 


Ser 


Ala 




530 










535 










540 










Glu 


Ser 


Leu Asp Asn 


Leu 


Val 


Ala 


Thr 


Ser 


Thr 


Ala 


Pro 


Phe 


He 


Thr 


545 










550 










555 










560 


His 


Glu 


Thr Ala Arg 


Ala 


Ala 


Gin 


Ser 


Leu 


Asn 


Ser 


Pro 


Ala 


Pro 


Gly 










565 










570 










575 




Gly 


Phe 


Leu 


He 


Val 


Ser 


Asp 


Glu 


Gly 


Ser 


Asp 


Phe 


Val 


Asp 


Gin 


Pro 








580 










585 










590 






Trp 


Pro 


Ser 


Ala 


Ser 


Leu 


Tyr 


Asp 


Tyr 


Pro 


Pro 


Gin 


Glu 


Gin 


Trp 


Asn 






595 










600 










605 








Ala 


Phe 


Gin 


Lys 


Leu 


Ala 


Gin 


Met 


Leu 


Met 


His 


Thr 


Pro 


Phe 


Asn 


Ala 




610 










615 










620 










Glu 


Gly 


He 


Val 


Thr 


Lys 


He 


Phe 


Thr 


Asp 


Ala 


Asn 


Gly 


Thr 


Gin 


His 


625 










630 










635 










640 


lie 


Gly 


Leu 


His 


Pro 


He 


Pro 


Asp 


Arg 


Ser 


Gly 


Leu 


Trp 


Arg 


Tyr 


Leu 










645 










650 










655 




Ser 


Thr 


Thr 


Leu 


Leu 


Leu 


Leu 


Thr 


Met 


Leu 


Gly 


Ser 


Ala 


He 


Tyr 


Asn 








660 










665 










670 






Gly 


Val 


Gin 


Ala 


Trp 


Arg 


Arg 


Tyr 


Gin 


Arg 


His 


Arg 


Thr 


Arg 


Met 


Met 






675 










680 










685 








Glu 


He 


Gin 


Ala 


Tyr 


Tyr 


Glu 


Ser 


Cvs 
-* 


Leu 


Asn 


Pro 


Gin 


Leu 


He 


Thr 




690 










695 










700 










Pro 


Ser 


Glu 


Ser 


Leu 


He 


Glu 




















705 










710 






















<210> 457 




























<211> 237 




























<212> PRT 




























<213> Escherichia coli 






















<400> 457 




























Met 


Leu 


Pro 


Cys 


Arg 


Ala 


Asn 


Cys 


Phe 


Thr 


Leu 


Glu 


He 


Ser 


Leu 


Met 


1 








5 










10 










15 




His 


He 


Asn 


He 


Ala 


Trp 


Gin 


Asp 


Val 


Asp 


Thr 


Val 


Leu 


Leu 


Asp 


Met 








20 










25 










30 






Asp 


Gly 


Thr 


Leu 


Leu 


Asp 


Leu 


Ala 


Phe 


Asp 


Asn 


Tyr 


Phe 


Trp 


Gin 


Lys 






35 










40 










45 








Leu 


Val 


Pro 


Glu 


Thr 


Trp 


Gly 


Ala 


Lys 


Asn 


Gly 


Val 


Thr 


Pro 


Gin 


Glu 




50 










55 










60 










Ala 


Met 


Glu 


Tyr 


Met 


Arg 


Gin 


Gin 


Tyr 


His 


Asp 


Val 


Gin 


His 


Thr 


Leu 


65 










70 










75 










80 


Asn 


Trp 


Tyr 


Cys 


Leu 


Asp 


Tyr 


Trp 


Ser 


Glu 


Gin 


Leu 


Gly 


Leu 


Asp 


He 










85 










90 










95 




Cys 


Ala 


Met 


Thr 


Thr 


Glu 


Met 


Gly 


Pro 


Arg 


Ala 


Val 


Leu 


Arg 


Glu 


Asp 








100 










105 










110 






Thr 


He 


Pro 


Phe 


Leu 


Glu 


Ala 


Leu 


Lys 


Ala 


Ser 


Gly 


Lys 


Gin 


Arg 


He 






115 










120 










125 








Leu 


Leu 


Thr 


Asn 


Ala 


His 


Pro 


His 


Asn 


Leu 


Ala 


Val 


Lys 


Leu 


Glu 


His 




130 










135 










140 










Thr 


Gly 


Leu 


Asp 


Ala 


His 


Leu 


Asp 


Leu 


Leu 


Leu 


Ser 


Thr 


His 


Thr 


Phe 


145 










150 










155 










160 



WO 01/34810 



PCT/US00/30950 



Gly Tyr Pro 


Lys 


Glu Asp 


Gin Arg 


Lr€u i rp nib 


rixcl val 


A 1 


ulU nlo 






165 














x / j 


Thr Gly Leu 


Lys 


Ala 


Glu 


Arg Thr 


Leu Phe He 


Asp Asp 


Ser 


Glu Ala 




180 










185 




190 




lie Leu Asp Ala 


Ala 


Ala 


Gin 


Phe 


uiy lie /uy 




T.oii 


vjiy vai 


195 










200 










Thr Asn Pro 


Asp 


Ser 


Gly 


Tip 


Hid 






Arg 




210 














220 






Ser Leu Asn Asp 


Tyr Arg 


Arg 


Leu 


He Pro Ser 


Leu Met 






225 






230 














<210> 458 




















<211> 133 




















<212> PRT 




















<213> Escherichia coli 














<400> 458 




















Met Lys Glu 


Lys 


Pro 


Ala 


Val 


Glu 


val Arg Leu 


Asp Lys 


Trp 


Leu Trp 


1 




5 








1U 






ID 


Ala Ala Arg 


Phe 


Tyr 


Lys 


Thr 


Arg 


Ala Leu Ala 


A ^ Tit 

Arg biu 


Met 


lie Csiu 




20 










ZD 




^n 




Gly Gly Lys 


Val 


His 


Tyr 


Asn 


\jj.y 


Gin Arg Ser 


Lys Pro 


ber 


Lys He 


35 










a n 

fl u 










Val Glu Leu 


Asn 


Ala 


Thr 


T .on 


Thr 


Leu Arg Gin 


Gly Asn 


Asp 


Glu Arg 


50 








55 






fin 






Thr Val lie 


Val 


Lys 


Ala 


He 


Thr 


ulu ui.II rtX. y 


rtx y rLU 


rtJL d 


Car f21 
OCX Ui LI 


65 






70 






75 






80 


Ala Ala Leu 


Leu 


Tyr 


Glu 


Glu 


Thr 


Ala Glu Ser 


Val Glu 


Lys 


Arg Glu 






85 








90 






95 


Lys Met Ala 


Leu 


Ala 


Arg 


Lys 


Leu 


Son Al a T.£*n 

rV3 1 1 rtX a tic LL 


Thr Met 


Pro 


His Pro 


100 










105 




110 




Asp Arg Arg 


Pro 


Asp 


Lys 


Lys 


(Zl it 


r\L y rt i. y rta 


T.^ii Titan 


Arg 


L. 1 IC J-lJfO 


115 










120 




125 






His Gly Asp 


Ser 


Glu 
















130 




















<210> 459 




















<211> 294 




















<212> PRT 




















<213> Escherichia coli 














<400> 459 




















Met He Met 


Pro 


Gin 


His 


Asp Gin 


Leu his Arg 


lyr Leu 


trie 


1 Ann 

uiu Asn 


1 




5 








i n 

1U 






i ^ 
i j 


Phe Ala Val 


Arg 


Gly 


Glu 


Leu 


Val 


TKv Ual C Qr 

inr vai oer 


Pin Thr 

lj±u inr 


Leu 


bin win 




20 
















He Leu Glu 


Asn 


His 


Asp 


Tyr 


Pro 


uln irro vai 


T i;e Ann 

jjys Han 


Val 


L6U Ala 


35 










40 










Glu Leu Leu 


Val 


Ala 


Thr 


Ser 


Leu 


T on T 1 V"i t" Al -\ 

LcU inr nla 


Thr Leu 


Lys 


Dh a A c n 

irne risp 


50 








55 












Gly Asp He 


Thr 


Val 


Gin 


Leu 


Gin 


vjxy rto^j vjxy 


p-r~o Mot* 

riv lit; u 


rtO 11 


T.^n Al a 


65 






70 






75 






80 


Val He Asn 


Gly 


Asn 


Asn 


Asn 


Gin 


olll ClcL rtx y 


vjx y vox 


Al a 


Ara Val 
rtx y v ci x 






85 








90 






95 


Gin Gly Glu 


He 


Pro 


Glu 


Asn 


Ala 


Asp Leu Lys 


Thr Leu 


val 


Gly Asn 


100 










105 




110 




Gly Tyr Val 


Val 


He 


Thr 


He 


Thr 


Pro Ser Glu 


Gly Glu 


Arg 


Tyr Gin 


115 










120 




125 






Gly Val Val 


Gly 


Leu 


Glu 


Gly Asp 


Thr Leu Ala 


Ala Cys 


Leu 


Glu Asp 


130 








135 






140 







-411- 



WO 01/34810 



PCT/USOO/30950 



Tyr 


Phe 


Met 


Arg 


Ser 


Glu 


Gin 


Leu 


145 










150 






Gly 


Asp 


Val 


Asp 


Gly 


Lys 


Pro 


Ala 










165 








Met 


Pro 


Ala 


Gin 


Asn 


Ala 


Gin 


Gin 








180 










Leu 


Thr 


Glu 


Thr 


He 


Lys 


Thr 


Glu 
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295 








300 








Ser Lys 


Pro 


Ser 


Thr 


He 


Phe 
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Although claims 129-131 are directed to a method of treatment of the 
human/animal body, the search has been carried out and based on the 
alleged effects of the compound/composition. 
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Present claims 36, 44, 57, 95, 98, 1G9 and 117 relate to a compound 
defined by reference to a desirable characteristic or property, namely 
being identifiable by using the method of claims 28, 38, 45, 84, 96, 99, 
or 110, respectively. Present claims 125 and 126 relate to a compound 
defined by reference to a desirable characteristic or property, namely 
interacting with a gene or gene product or a polypeptide whose activity 
or expression is inhibited by an anti sense nucleic acid comprising one of 
SEQ ID NOS 1-127. 

The claims cover all compounds having this characteristic or property, 
whereas the application provides support within the meaning of Article 6 
PCT and/or disclosure within the meaning of Article 5 PCT for only a very 
limited number of such compounds. In the present case, the claims so lack 
support, and the application so lacks disclosure, that a meaningful 
search over the whole of the claimed scope is impossible. Independent of 
the above reasoning, the claims also lack clarity (Article 6 PCT). An 
attempt is made to define the compound by reference to a result to be 
achieved. Again, this lack of clarity in the present case is such as to 
render a meaningful search over the whole of the claimed scope 
impossible. Consequently, the search has been carried out for those parts 
of the claims which appear to be clear, supported and disclosed, namely 
those parts relating to the sequences of claims 1, 9 and 19 . 

The applicant's attention is drawn to the fact that claims, or parts of 
claims, relating to inventions in respect of which no international 
search report has been established need not be the subject of an 
international preliminary examination (Rule 66.1(e) PCT). The applicant 
is advised that the EPO policy when acting as an International 
Preliminary Examining Authority is normally not to carry out a 
preliminary examination on matter which has not been searched. This is 
the case irrespective of whether or not the claims are amended following 
receipt of the search report or during any Chapter II procedure. 
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This International Searching Authority found multiple (groups of) 
inventions in this international application, as follows: 

1. Claims: 1-8, 12-14, 45-124, 129-130 all partially 

Invention 1: 

A purified or isolated nucleic acid sequence, consisting of 
Seq Id No 1, a vector comprising said sequence, a host cell 
containing said vector, and their uses. 



2. Claims: 1-8, 12-14, 45-124, 129-130 all partially 
Inventions 2 to 127: 

Idem as invention 1, but for Seq Id Nos 2-127, respectively. 



3. Claims: 9-11, 15-44, 125-128 all partially 
Invention 128: 

A purified or isolated nucleic acid sequence consisting of 
Seq Id No 128, a vector comprising said sequence, a host 
cell containing said vector, a polypeptide encoded by said 
nucleic acid sequence and having Seq Id No 299, and an 
antibody binding said polypeptide, and their uses. 



4. Claims: 9-11, 15-44, 125-128 all partially 
Inventions 129 to 298: 

Idem as invention 128, but for nucleic acid Seq Id Nos 
129-298 and corresponding polypeptide Seq Id Nos 3G0-469, 
respectively. 
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